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wo 97/49822 PCT/EP97/01218 
GENE CONFERRING DISEASg RESISTANCE iN PLANTS AND USES THEREOF 



The present invention relates to disease resistance in plants and to identifying and 
breeding disease resistance into plants. More particularly, the present invention relates to 
the identification, isolation and characterization of a gene involved in broad spectrum 
disease resistance in plants. 

Plants are constantly challenged by a wide variety of pathogenic organisms 
including viruses, bacteria, fungi, and nematodes. Crop plants are particularly vulnerable 
because they are usually grown as genetically-uniform monocultures; when disease strikes, 
losses can be severe. 

However, most plants have their own innate mechanisms of defense agamst 
pathogenic organisms. Natural variation for resistance to plant pathogens has been 
identified by plant breeders and pathologists and bred into many crop plants. These natural 
disease resistance genes often provide high levels of resistance to or immunity against 
pathogens. 

In many plant species an initial inoculation by a necrotizing pathogen can immunize 
the plant to subsequent infection. This acquired disease resistance was first documented in 
1901 and is thought to play an important role in the preservation of plants m nature. 
Particularly well characterized examples of plant immunity are the phenomenon of systemic 
acquired resistance (BAR) and induced resistance in plants such as tobacco, Arabiaopsis 
and cucumber. In these systems, inoculation by a necrotizing pathogen results in systemic 
protection against subsequent infections by that pathogen as well as a number of other 
agronomicalty important bacterial, fungal and viral pathogens. 
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systemic acquired ,esi8.ance can also be triggered by chemical immunization 
compounds, certain chemicals that induce the immunity response in plants. Such 
compounds can be o. natural origin, such as salicylic acid (SA). or can be synthetic 
Chemicals, such as 2.6.dichloroisonico«nic acid (INA) and benzo(,,2,3)thiadiazole.7. 
ca*othioic acid S-methyl ester ,BTH,. Treatment with a pathogen or an immunizaHon 
compound induces the expression o. a. least nine sets o, genes in tobacco, the best 
characterized species. Ditterent numbers and types o, genes can be expressed in other 
Plants. The level o, induction ,o, SAR-rela.ed genes i^iuced by Immunization compounds 
is as high as ,0,00O-,old over bacKground. ,„ particular. SAR is characterized by the 
expression of SAR genes, including the pathogenesis-related (PR) genes. 

The SAR genes are induced toliowing inlection by a pathogen. Some of these 
genes have a role In providing systemic acguired resistance to the plant. These plan, 
proteins are induced in large amounts in response to infection by various pathogens 
.nciuding viruses, baCerU. and fungi. PR proteins were firs, discovered In K.bacco plants 
(N^an. u^cum) reacting hypersensi^iy ,„ infecfion wi,h tobacco mosaic virus (TMV, 
subsequently, PR proteins have been found in many plant species ,se. Redolfl et a, 
(t 983) mh J PlM, Earns. 89: 245-254: Van Loon ,,985) Elan! ^ ^ 4: „ 1-1 , e- and 
UKnes e. al. ,,992, ESn! m 4: 645-656.) Such proteins are believed to be a common 
defensive systemic response of plants to infection by pathogens. 

Pathogenesis-related proteins include but are not limited to SAR8.2a and SAR8 2b 
proteins, the acidic and basic forms of tobacco PR-ia, PR-tb, and PR-to; pr.,. pr.^ pR ^. 
PR-a -, PR-N, PR-o. PR-O'. PR.4. PR-P. PR-o, PR-s, and PR-R maior proteins; culumbei 
Pe^xidases: basic cucumber peroxidase: the ohitinase which is a baste counten«.t of PR-P 
or PR-Q; the beta- , ,3-g,ucanase (gfucan endo- 1,3-beta-g,ucos.dase, EC 3.2.,.39, which is 
a basic counterpart of PR-2, PR-N or PR-O: and the pa.hogan-inducibie chitinass from 
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cucumber. Such PR proteins are disclosed, for example, in Uknes et al. (1992) The Plant 
Cell 4: 645-656 and the references cited therein. 

SAR or SAR-IIke genes are expressed in all plant species exhibiting systemic 
acquired resistance. Expression of such genes can be determined by probing with known 
SAR DNA sequences. For example, see Lawton et al. (1 992) Proceedings of the Second 
European Federation of Plant Pathology (1983), In: Mechanisms of Defence Resoonses in 
Plants . B. Fritig and M. Legrand (eds). Kluwer Academic Publishers, Dordrecht, pp. 410- 
420; Uknes et aL (1992) Ihe Plant Cell 4: 645-656; and Ward et al. (1991) Ihe Plant Cell 3: 
1085-1094. Methods for hybridization and cloning are well known In the art. See, for 
example, Molecular Cionino. A Laboratory Manual . 2nd Edition, Vol. 1-3, Sambrook et al. 
(eds.) Cold Spring Harbor Laboratory Press (1989) and the references cited therein. 

Alternatively such SAR or SAR-like genes can be found by other methods such as 
protein screening, +/- screening, etc. See, for example, Liang and Pardee (1992) Science 
257: 967-971; and St. John and Davis (1979) Gel! 16: 443. 

Despite much research and the use of sophisticated and intensive crop-protection 
measures, including genetic transformation of plants, losses due to disease remain in the 
billions of dollars annually. Disease resistance genes have previously been cloned but 
transgenic plants transformed with these genes would typically be resistant only to a subset 
of strains of a particular pathogen species. Despite efforts to clone genes involved in SAR, 
a gene controlling broad spectrum disease resistance has not been isolated and 
characterized. 

Several lines of evidence indicate that endogenously produced SA is involved in the 
signal transduction pathway coupling the perception of pathogen infection with the onset of 
SAR. Mutants which retain the ability to accumulate SA in response to pathogen yet have 
lost the ability to induce SAR genes or resistance after application of SA or INA have been 
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described ,y Delaney, e. a,.. £,,sUmU^,i^,2: 6602-6606 (,996, and ,n 
WO94/,6077 the Whole o( which Is Inaxpomted herein by reference. 

I. hae now heen discovered «,a. .he« mu«„« conWn a n,u,an, gene, which gene 
in Its wildtype Ion. cohm.ls SAR gene expression and SAR l,se«. The presen. invent^n 
recognize, .ha, «,e n,u,an, gene confers broad specru. disease suscepMiy ,o .u.,n. 
Plante and renders them nonindudbie ,o pathogens and diemica. induceis. 

The present invention concerns the ldenti«catlon. isolation and characterization of 
.he wildtype ,MA„, gene, a gene which allows acUvation in a plan, o, SAR and SAR gene 
expression in response to biological and chemical Inducers. 

A mutant gene has been iden«led In Mutagen^ed A^o^ p^. ^nese plants 
have been foond to be defective In .he. nomial response to pathogen InfecUon in tha, ,hey 
do no. express genes assocla.ed with sys.emic acuired resls^nce ,SAR, nor are Htey 
capable of exhibiflng SAR. These mu.„.s contain a defective gene which has been 
labelled n/ml (for noninducible immunity). 

The present Invention also concede me use o, .he cloned «fM, g,„e and var,an,s 
«.a«o, .0 crea.e «ansgeh. planu ««. have broad speCrum disease resls^nce and to the 

-V/M, gene and vaHants thereof In a screenin. .ethcd for identifying compounds capable 
of inducing broad spectrum disease resistence In plante. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the effect of chemical inducers on the induction of PR gene 
expression in wild-type and nimi plants. 

Figure 2 depicts PR-1 gene expression in pathogen-Infected Ws-0 and nimi plants 
over the course of 6 days from the initiation of infection. 

Figure 3 shows the levels of SA accumulation in Ws-O and nimi plants infected with 
P. syringae. 

Figure 4 shows the genetic map of the NIM1 region as determined by AFLP and 
SSLP analysis. 

Figure 5 depicts a physical map of the NIMI region as determined by YAC clone 
analysis. 

Figure 6 shows a physical map of an extended P1/BAC contig, 

Figure 7 shows a physical map setting forth the positions of P1 and BAG clones with 

respect to the flanking AFLP markers and YACs. 

Figure 8 shows a physical map of a further extended P1/BAC contig containing the 

NIMI gene. 

Figure 9 shows an integrated genetic and physical fine map of the NIM region. 
Figure 10 shows an integrated map of the NIM1 region. 

Figure 11 shows an integrated map of the NIMI region including the new AFLP 
markers. 

Figure 12 is a schematic representation of recombinants D169 and CI 05. 

Figure 13 is a global map of the chromosomal region centered on NIMI with 
recombinants indicated, including, BACs. YACs and Cosmids in NIM1 region. 

Figure 14 provides the sequence of the 9.9kb region of clone BAC-04 containing the 
NIMI gene. 
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Fi9ur. ,5 sho„s me n^leio acl. sequence o, U,e Nm gene and ,He a.,„o acW 
sequence o, ,he N,M, gene p«^uc.. inCuding changes („ me vaHoue alleles. 

Figure ,6 shows *e exp^ssion o, MM, induced by INA, BTH, SA and pathogen In 
Wild type and mutant alleles of nimi. 

Figure 17 shows me expression c. PR-, ,„ ^ ^ 

Figure 18 shows disease resistance In various nimi mutants. 
Figure ,9 Is an amino add sequence comparison o, Expressed Sequence Tag 
regions o, the M*,, protein and cONA protein products o, 4 rice gene sequences (see SEQ 
ID NO: 3). 



DEFINlTinMg 

AA: Amino Acid 

AFLP: Amplified Fragment Length Polymorphism 

avrRpt2: avirulence gene Rpt2. isolated from Pseudomonas syringae 

BAC: Bacterial Artificial Chromosome 

BTH: ben2o(1.2.3)thiadia2ole-7-carbothioic acid S-methyl ester 

Col: Arabidopsis ecotype Columbia 

ECs: Enzyme combinations 

INA: 2,6-dichloroisonicotlnic acid 

Len Arabidopsis ecotype l^ndsberg erecta 

NIMV. the wildtype gene, conferring disease resistance to «,e plant 

ninr, mutant allele of NiMI, conferring disease susceptibility to the plant 

nimi: mutant plant line 

ORF: open reading frame 

PCs: Primer combinations 
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SA; salicylic acid 

SAR: Systemic Acquired Resistance 

SSLP: Simple Sequence Length Polymorphism 

Ws-0 : Arabidopsis ecotype Wassilewsidja 

YAC: Yeast Artificial Chromosome 



The NIM1 gene has been doned by mapping and wall<ing techniques which indicate 
that the gene is contained in a -105 Kb region. (See Figure 13 and Table 16). This region 
is delineated by the L84.6b marlcer on the left and the L84.T2 marker on the right. Only 
three overlapping cosmids made from wild-type DNA from the 105 Kb region complement 
the nimi mutant phenotype (Figure 13 and Table 16). These three cosmids only overlap in 
a 9.9 Kb region defined by the left end of cosmid clone D7 and the right end of cosmid 05 
as pictured in Figure 13. Many other cosmids made to other areas of the 105 Kb region do 
not complement the nimi phenotype (Figure 13 and Table 16). A near full length cDNA 
clone to the NIMI gene indicates the appropriate intron-exon borders and defines the amino 
acid sequence of the gene product. Only the NIMI gene region within the 9.9 Kb 
complementing region has sequence changes in various nimi mutant alleles (Table 18). 
Three other potential gene regions showed no sequence changes that are assosciated with 
the nimi phenotype. The sequence changes found In the NIMI gene region are consistent 
with altered function or loss of function of the gene product. The severity of the change to 
the NIMI gene region in a particular mutant allele Is roughly correlated with the observed 
physiological severity of that nimi allele. Only the NIMI gene region had detectable RNA 
(transcription) and this RNA showed abundant changes consistent with the physiological 
role of NIMI in pathogenesis (Table 18 and Figure 16). 
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The present ,„ve„>ion relates ,o an isolated gene .ragmen,, the MM, gene, which Is 
a Key cponen, o. «,e systemic acquired res^„ce (SAR, pathway in plants. The N,W 
gene Is associated with the actuation o, SAR by ohemlcal and biological inducers and In 
coniuncuon with such Inducers. Is re<^lred for SAR and SAR expression 

The l««tio„ o, ,h, MMgene Is detennined by molecular biological analysis o. the 
genome o, mutant p^nts Known to carry the mutant n>n., g,„e. which gives the host plants 
extreme sensl«v«y to a wide vanety o, pathogens ^ renders them unable to respond to 
pathogens and chemical inducers of SAR. 

/V^r mutants are useful as -un^rsal dteease suscep^ble- ,UDS, plants by virtue o. 
.heir being suscepUble to many strains and pathotypes o, pathogens o, the host plant and 
also to pathogens which do not nomtally Intect the host plant, bu, wh«h Intec. other hosts 
They can be generated by treaUng seeds c other ..ologlcal materia, with mutagen^ agents 
and then selec«ng progeny plants tor the UDS phenotype by trea„„g progeny plants ...h 
Known Chemical Inducers <e.g. .NA, o, the systemic aa,„lred response and then in,ec«ng 
«.e Plants w«h a Known pathogen. Nonindudble mutants deveK>p severe disease symptoms 
under these circumstances, whereas non-mutants are Induced by the chemical compound 
.0 systemic acpulred resistance. „^ mutants can be egually selected trom mutant 
popula^ons generated by chemical and irradlaUon mutagenesis, as well as pcpuMcns 
generated by T-DNA insertion and transposon-induced mutagenesis. 

Techniques .or generating mutant ptant lines are well Known In the art. The «m 

plant phenotype is used as a tool to Idenlifv an i.<i«i>f<i^ . 

■ueniity an isolated gene fragment which allows 

expression of broad spectnim disease resistance In plants. 

comprised by the present lnven«on is an Isolated ONA mcecuie compHsing a mutant gene 
Of the NIM1 gene which Is a nimi gene. 
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Following the use of a nimi mutant or plant to isolate the wild-type NIM1 gene 
necessary for constitutive expression of SAR genes, the resistance trait, in combination with 
other characteristics Important for production and quality, can be incorporated into plant 
lines through breeding. Breeding approaches and techniques are Icnown in the art See, for 
example. Welsh J. R., Fundamentals of Plant Genetics and Breeding . John Wiley & Sons. 
NY (1981); Crop Breeding. Wood D. R. (Ed.) American Society of Agronomy Madison. 
Wisconsin (1983); Mayo O., The Theory of Plant Breeding Second Edition, Clarendon 
Press, Oxford (1987); Singh, D.P., Breeding for Resistance to Diseases and Insect Pests . 
Springer- Verlag, NY (1986); and Wricke and Weber, Quantitative Genetics and Selection 
Plant Breeding . Walter de Gruyter and Co., Berlin 1986). 

A further object of the invention is a chimeric gene comprising a promoter active in plant 
operably linked to a heteroioguous DNA molecule encoding the aminoacid sequence of a 
NIMI gene product and variants thereof according to the invention. 

Methodologies for the construction of plant expression cassettes as well as the 
introduction of foreign DNA into plants is generally described in the art. Generally, for the 
introduction of foreign DNA into plants, Ti plasmid vectors have been utilized for the delivery 
of foreign DNA. Also utilized for such delivery have been direct DNA uptake, liposomes, 
electroporation, micro-injection, and microprojectiles. Such methods had been published in 
the art. See, for example, Bilang et al. (1991) Gene 100: 247-250; Scheid et al., (1991) MoL 
QSTL QengL 228: 104-1 1 2; Guerche et al.. (1987) Plant Science 52: 1 1 1 -1 1 6; Neuhause et 
al., (1987) Theor. AqeL Genet. 75: 30-36; Klein et al.. (1987) Nature 327: 70-73; Howell et 
al., (1980) SciSQ£a.208:1265; Horsch et al.. (1985) Science 227: 1229-1231; DeBlock et al., 
(1989) Plant Physioloqy 91 : 694-701; Methods for Plant Molecular Rinlngy (Weissbach and 
Weissbach. eds.) Academic Press, Inc. (1988); and Methods in Plant Molecular Bioloav 
(Schuler and Zielinski, eds.) Academic Press, Inc. (1989). See also US. patent application 
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Serta, Nos. 08/438,666 ,l,e. May , 0. ,995, and WO 93/07278, both o, which are 
incorporated herein by reference in «,eir entirety. is understood that ,he n,e.hod o. 
transformation will depend upon ,he plant cell to be Iransfonned. 

1. IS further recoonized ma. the components o, the expression cassette may be 
modified to increase expression. For example, truncated sequences, nucleotide 
substitutions or other modifications may be employed. Plant cells t,ans,om,ed with such 
modlHed expression systems. ««n. would exh.b« overexpression or cons„.uUve expression 
Of SAR genes necessary for activation of SAR. 

The DNA molecule or gene fragment conferHng disease resistance to plants by 
allowing inducUon o, SAR gene expression can be Incorporated in plant or bacterial cells 
using conventions re«,mbinan, DMA technology. Generally, this ^^s i„se,«ng the DNA 
molecule into an expression system to which the DNA molecule is heterologous ,i e not 
normally present,. The heterologous DNA molecule is inserted Into the expression system 
or vector In proper orientat^n and conect reaCng ,„me. The vector contains the necessa^ 
elements for the t«nsoHpHon and translaUon o, the Inserted protein-codlng seguences A 

bacteriophage viruses and other modified viruses. Suitable vectors include, bu, are not 
l.n,ited to. vira, vectors such as lambda vector systems ,g,M. |g«o and Charon 4. plasmid 
vectors such as PB.,2,. pBR322, pACVC,77, pACyC,84. pAR series. pKKa23-3 
PUC8, PUC9, p,^,8. pUCtg. P.0339. pRK390, pKC37, pKCOt. pCDNAII; and other 
Similar systems. The DMA seguences can be cloned into the vector using standard .oning 
procedures in the art. as described by Maniatis et a,., m^^^m^m,,,,^^,^ 
mm. Cold Spdng Laborato^, Cold Spring Harbor, New York (1982) 
A further ob,ec, of the invention a recombinant vector comprising the chimehc gene 
accordrng to the invention. 
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In order to obtain efficient expression of the gene or gene fragment of the present 
invention, a promoter must be present in the expression vector, RNA polymerase normally 
binds to the promoter and initiates transcription of a gene. Promoters vary in their strength, 
i.e., ability to promote transcription. Depending upon the host cell system utilized, any one 
of a number of suitable promoters can be used. Suitable promoters include ubiquitin, nos 
promoter, the small subunit ribulose bisphosphate carboxylase gene promoter, the small 
subunit chlorophyll A/B binding polypeptide promoter, the 35S promoter of cauliflower 
mosaic virus, and promoters isolated from plant genes. See C.E. Vallejos, et al., 
"Localization in the Tomato Genome of DNA Restriction Fragments Containing Sequences 
Homologous to the rRNA (45S). the major chlorophyll A/eBinding Polypeptide and the 
Ribulose Bisphosphate Carboxylase Genes," Genetics 1 12: 93-105 (1986), which discloses 
the small subunit materials. The nos promoter and the 35S promoter of cauliflower mosaic 
viais are well known in the art. 

Once the disease resistance gene of the present invention has been cloned into an 
expression system, it is ready to be transformed into a plant cell. Plant tissues suitable for 
transformation include leaf tissues, root tissues, meristems, and protoplasts. 

Bacteria from the genus Agrobacterium can be utilized to transform plant cells. 
Suitable species of such bacterium include Agrobacterium tumefaciens and Agrobacterium 
rhizogens. Agrobacterium tumefaciens (e.g., strains LBA4404 or EHA105) is particularly 
useful due to its well-known ability to transform plants. 

Another approach to transforming plant cells with a gene involves propelling inert or 
biologically active particles at plant tissues and cells. This technique is disclosed in U.S. 
Patent Nos. 4,945,050; 5,036,006; and 5,100,792 all to Sanford et al. Generally, this 
procedure involves propelling inert or biologically active particles at the cells under 
conditions effective to penetrate the outer surface of the cell and afford incorporation within 
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me ,n,e™r mereo.. When iner, partCes are u.i.i.e<.. me veco, can be (nmnJuced .„,o «,e 
cell by coa.ng *e particles me vector cor„al„,„g me desired gene. Alterrtatlvely, me 
targe, cell can be sur^unded by me vector so that the vector is carried into me cell by me 
wa.e 0. me particle. Biologically act^e parades (e.g.. dried yeas, cells, drfed bacterium or a 
bacteriophage, each con^nlng DNA sought to be Introduced) can also be propelled Into 
plant ceil tissue. 

The isolated gene Iragment o. me present invention can be utilized to confer 
d,sease resistance to a wide variety o, plan, cells, including mose o, gy^nosperms 

these broad classes, it is particuiarly usen,. In crop plan, cells, such as rtce, wheat, bariey 
rye, corn, potato, carro., swee.po.a.o, sugar beet, bean, pea. chicory, lettuce, cabbage 
caulltlower. broccoii. turnip, radish, spinach, asparagus, onion, garlic, eggplant, pepper 
celery, carrot, spuash, puntpMn, zucchini, cucumber, apple, pear. ,ul„ce, meton. plum 
Cherry, peach, nectarine, apricot, strawberry, grape, raspberry, blackberry, pineapple 
avocado, papaya, mango, banana, soybean, tobacco, tomato, sorghum and suga^e 

The expression system o< the present invention can be used to transform virtually 
any cop plant ceil under suitable conditions. Trans,om.ed cells can be regenerated in.o 
Whole p..nts such mat the gene imparts disease resistance to the intact transgenic plants 
AS se. form above, me expression system can be modified so mat me d»ease resistance 
gene is continuously or constitutlvely expressed. 



Transformation 

The present system can be utilized In any plant which can be transformed and 
regenerated. Such methods for transformation and regeneration are well known In the art 
As well as me above cited references, see also. An, Q,, Watson, B.D., and Chiang. O C 
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Transformation of tobacco, tomato, potato, and Arabidopsis thaliana using a binary Tl 
vector system. Plant Physiol. 81:301-305, 1986; Fry, J., Barnason. A., and Horsch. R.B. 
Transformation of Brassica napus with Agrobacterium tumefaciens based vectors. Pl.Cell 
Rep. 6:321-325, 1987; Blocl<. M.d. Genotype independent leaf disc transformation of potato 
(Solanum tuberosum) using Agrobacterium tumefaciens. Theor.appl.genet. 76:767-774, 
1988; Deblocl<, IVl., Brouwer, D.D., and Tenning, P. Transformation of Brassica napus and 
Brassica oleracea using Agrobacterium tumefaciens and the Expression of the bar and neo 
genes in the transgenic plants. Plant Physiol. 91 :694-701, 1989; Baribault, T.J., Skene, 
K.G.M., Cain, P. A., and Scott, N.S. Transgenic grapevines: regeneration of shoots 
expressing beta-glucuronidase. Pl.Cell Rep. 41:1045-1049, 1990; Hinchee, M.A.W., Newell, 
C.A., ConnorWard, D.V., Armstrong, T.A.. Deaton, W.R., Sato. S.S.. and Rozman. R.J. 
Transformation and regeneration of non-soianaceous crop plants. Stadler.Genet.Symp. 
203212.203-212, 1990; Barfield, D.G. and Pua, E.G. Gene transfer in plants of Brassica 
juncea using Agrobacterium tumefaciens-mediated transformation. Pl.Cell Rep. 10:308-314. 
1991; Cousins, Y.L, Lyon, B.R., and Llewellyn, D.J. Transfomfiation of an Australian cotton 
cultivar: prospects for cotton improvement through genetic engineering. Aust. J. Plant 
Physiol. 18:481 -494. 1991; Chee. P.P. and Slightom, J.L. Transformation of Cucumber 
Tissues by MIcroprojectile Bombardment Identification of Plants Containing Functional and 
Nonfunctional Transferred Genes. GEA/E 118:255-260, 1992; Christou, P., Ford. T.L., and 
Kofron, M. The development of a variety-independent gene-transfer method for rice. 
Trends.Biotechnol. 10:239-246, 1992; D'Halluin, K., Bossut, M., Bonne. E.. Mazur, B.. 
Leemans, J., and Botterman, J. Transformation of sugarbeet (Beta vulgaris L.) and 
evaluation of herbicide resistance in transgenic plants. Bio/Technol. 10:309-314. 1992; Dhir, 
S.K., Dhir. S., Savi<a. M.A.. Belanger, F., Kriz, A.L., Farrand, S.K., and Widholm, J.M. 
Regeneration of Transgenic Soybean (Glycine Max) Plants from Electroporated Protoplasts. 
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PLAm PHYSIOL 99:8,-88. ,992; Ha. S.B.. Wu. F.S.. and Thome. T. K. Transgenic .urf- 
■ype .an fescue (Fesluca arundlnacea Schreb.) ptenis regenerated from protopteste. PL csa 
Rep. , , :60, -604. ,992; Blechl. A.E. Qene.lc Transformation The New Too. for Whea. 
Improvement 78.h Annual Meettng Keynote Addi^ss. CEflat FOOD WOULD 38:846-847. 
1993; Cases. A.M.. Kononowicz. A.K.. Zehr. U.B.. Tomes. D.T., Axte.1. J.D., Buller. L.Q., 
Bressan. R,A„ and Hasegawa. P.M. Transgenic Sorghum Plants via l^lcroproiectlle 
Bombardment. PROC NAT ACAD Sa USA 90:US,2.U2M. ,993; Christou. P. Philosophy 
and Prac«ce o. Variety Independent Qene Transfer in.o Recalcitrant Crops. IN VITRO CELL 
DEVBIOL-PLANT^P: , ,9-,24. 1993; Damlanl. F.. Nenz, E.. Padoccl, F.. and A^Honl, S. 
introduction of Hygromycln Resistance In Lotus spp Through Agrobacterlum Rhlzogenes 
Transformation. TRANSGENIC RES 2:330-335. ,993; Davles. D.R., Hamilton. J., and 
Mullineaux, P. Transformation of Peas. P/.CeffHep. ,2:,80-,83. 1993; Dong, J.Z. and 
Mchughen. A. Transgenic Flax Plan.s from Agrobac.erium Mediated Transformation 
incidence o, Chimeric Regenerants and Inheritance of Transgenic Plants. PLANT SCI 
9,:,39-148. 1993; Fitch. M.M.M.. Manshardt. R.M.. Gonsalves. D., and Sllghtom. J.L. 
Transgenic Papaya Plants from Agrobacterlum Mediated Transformation of Somatic 
Embryos. PLCell Rep. ,2:245-249, ,993; Franldln. C. and Trleu. T.N. Transformation o. 
the Forage Grass Caucasian eiuestem via Biollstlc Bombardment Mediated DNA Transfer 
PLANT PHYSIOL ,02:,67, ,993; GolovKln, M.V., Abraham. M.. Morocz. S., Bonka. S.. 
Feher. A., and Dudits. D. Production of Transgenic Maize Plants by Direct DNA Uptake into 
Embryogenlc Protoplasts. PM«r SC/90:4,-52, ,993; Quo. G.Q., Xu. 2.H., Wei. 2.M.. and 
Chen. H.M. Transgenic Plants Obtained from Wheat Protoplasts Transformed by Peg 
Mediated Direct Gene Transfer. CHIN SCI BULL 38:2072-2078. ,993; Asano. Y. and Ugaki, 
M. Transgenic plants of AgrosUs alba obtained by electroporationmedia.ed direct gone 
transfer In.o protoplas.s. PLCell Rep. 13. ,994; Ayres. N.M. and Park. W.D. Genetic 
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Transformatipn of Rice. CRIT REV PLANT SC1 13:219-239. 1994; Barcelo, P.. Hagel, C. 
Becker, D., Martin, A., and Lorz, H. Transgenic Cereal (Tritordeum) Plants Obtained at l-iigh 
Efficiency by Microprojectile Bombardment of Inflorescence Tissue. PLANT J 5:583-592, 
1994; Beclcer, D., Brettschneider, R,, and Lorz, H. Fertile Transgenic Wheat from 
Microprojectile Bombardment of Scutellar Tissue. PLANT J 5:299-307, 1994; Biswas, 
Q.C.G., Iglesias, V.A.. Datta, S.K., and Potrykus, I. Transgenic Indica Rice (Oryza Sativa L) 
Plants Obtained by Direct Gene Transfer to Protoplasts. J BIOTECHNOL 32: 1-10, 1994; 
Borkowska, M., Kleczkowski, K., Klos, B.. Jakubiec, J., and Wietgat, B. Transformation of 
Diploid Potato with an Agrobacterium Tumefaciens Binary Vector System .1. Methodological 
Approach, ACTA PHYSIOL PLANTS 6:225-230, 1994; Brar. Q.S., Cohen, B.A., Vick, C.L., 
and Johnson, G.W. Recovery of Transgenic Peanut (Arachis Hypogaea L) Plants from Elite 
Gultivars Utilizing Accoll(R) Technology. PLANT J 5:745-753, 1994; Christou, P. Genetic 
Engineering of Crop Legumes and Cereals Current Status and Recent Advances. AGRO 
FOOD IND HI TECH 5: 17-27, 1994; Chupeau. M.C., Pautot, V., and Chupeau, Y. Recovery 
of Transgenic Trees After Electroporation of Poplar Protoplasts. TRANSGENIC RES 3: 13- 
1 9, 1 994; Eapen, S. and George, L. Agrobacterium Tumefaciens Mediated Gene Transfer 
in Peanut (Arachis Hypogaea L). PLCell Rep. 13:582-586, 1994; Hartman, C.L., Lee, L., 
Day, P.R., and Turner, N.E. Herbicide Resistant Turfgrass (Agrostis Palustris Huds) by 
Biolistic Transformation. B/D-TECH/VOLOGy 12:919923, 1994; Howe. G.T., Goldfarb, B., 
and Strauss, S.H. Agrobacterium Mediated Transformation of Hybrid Poplar Suspension 
Cultures and Regeneration of Transformed Plants. Plant Cell Tissue & Organ Culture 
36:59-71, 1994; Konwar, B.K. Agrobacterium Tumefaciens Mediated Genetic 
Transformation of Sugar Beet (Beta Vulgaris L). J PLANTBIOCHEM BIOTECHNOL 3:37- 
41, 1994; Ritala, A., Aspegren, K., Kurten. U., Salmenkalliomarttila, M., Mannonen, L., 
Hannus, R., Kauppinen, V., Teeri, T.H., and Enari, T.M. Fertile Transgenic Barley by 
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Partcle Bombardn,en.of Immalure Er^^. PLANT MOL BIOL 24:317-325. 1994; SoORa, 
R.. cordts. J.M.. Rammins. D.W.. and Emershad. R.L. Transformation of Grape (Vitls 
Vinlfera L) Somatic Embryos and Regeneration of Transgenic Plants. J CELL BIOCHEM 
:102, 1994; St,lmamo.o, K. Gene Expreeaion in Transgenic Monocote. Ctyftfl 
OP,A,«O.ECHA,o. 5:158-162. ,994; Spangenberg, G.. Wang. Z.V., Nagei, J., and 
Potrykus. I. Protopias. Culture and Generation of Transgenic Plants in Red Fescue 
(Festuca Rubra L). P^/VTSC, 97:83-94. 1994; Spangenberg, G., Wang, 2.Y.. Nagal, J 
and PotryKus, I. Gene Transfer and Regeneration of Transgenic Plants in Forage G^eses 
J CBLL B,OCHB»^ :,02. 1994: Wan. V.c. and Lemaux. P.G. Generation of Urge Numbers 
of Independently Transfomted Fertile Barley Plants. PLANT PHYSIOL 104:3748, ,994; 
weeks, J.T., Anderson, O.D.. and Blechl. A.E. Stable Transformation of Wheat (Triticum 
Aestivum L) by MicroproJecMIe Bombardment. J CELL BIOCHEM :104. 1994; Ye. X.J.. 
Brown. S.K., Scorza, R.. Cordts. J., and Sanford, J.C. Genetic Transformation of Peacfi 
Tissues by Particle Bombardment. JAUER SOCHORTSC, 1,9:367-373, ,994- 
Spangenberg. G., Wang. 2.Y., Nagel, J., and Pot^kus, 1. PROTOPUST CULTURE AND 
GENERATION OF TRANSGENIC PLANTS IN RED FESCUE (FESTUCA RUBRA L). P,anr 
Science 189497:83-94. 1995. 



As nin,, l,ost plants may also be susceptible to pathogens outside of the host-range 
into Which they normally fall, these plants also have significant utility in the molecular, 
genetic, and biological study of hos.-pathogen interactions. Furthermore, the UDS 
Phenotype o, „/m, plants ateo renders ttiem of utility for fungicide screening. „™ , mutants 
selected in a particular host have considerable utility for the screening of ftrngiddes using 
that host and pathogens of the host. The advantage lies in the UDS phenoytpe of the 
mutant, which circumvents the problems encountered due to hosts being differentially 
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susceptible to different pathogens and pathotypes. or even resistant to some pathogens or 
pathotypes. 

Pathogens of the invention include but are not limited to viruses or viroids, e.g. 
tobacco or cucumber mosaic virus, ringspot virus or necrosis virus, pelargonium leaf curl 
virus, red clover mottle virus, tomato bushy stunt virus, and like viruses; fungi, e.g. 
Phythophthora parasitica and Peronospora tabacina; bacteria, e.g. Pseudomonas syringae 
and Pseudomonas tabacr, insects such as aphids, e.g. Myzus persicae; and lepidoptera, 
e.g., Heliomus spp.: and nematodes, e.g., Meloidogyne incognita. The methods of the 
invention are useful against a number of disease organisms of maize including but not 
limited to downy mildews such as Scleropthora macrospora, Scleroptittiora rayissiae, 
Sderospora graminicola, Peronosclerospora sorgtii. Peronosclerospora phillppinensis, 
Peronosderospora sacciiari and Peronosclerospora maydisr, rusts such as Puccinia sorphi, 
Puccinia polysora and Physopella zeae; other fungi such as Cercospora zeae-maydis, 
Colletotrichum graminicola, Fusarium monoliforme, Gibberella zeae, Exserohilum turcicum, 
Kabatiellu zeae and Bipolaris maydisr, and bacteria such as Envinia stewartii. 

DESCRIPTION OF THE SEQUENCE USTINIfi 

SEQ ID NO:1 - 9919-bp genomic sequence from Figure 14. 

SEQ ID NO:2 - 5655-bp genomic sequence from Figure 15. 

SEQ ID NO:3 - AA sequence of wild-type NIM protein encoded by cds of seq id 

no:2. 

SEQ ID NO:4 - Rice-1 AA sequence 33-155 from Figure 19. 
SEQ ID NO:5 - Rice-1 AA sequence 215-328 from Figure 19. 
SEQ ID N0:6 - Rice-2 AA sequence 33-155 from Figure 19. 
SEQ ID N0:7 - Rice-2 AA sequence 208-288 from Figure 19. 
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SEQ ID NO:8 . Rice-3 AA sequence 33-155 from Figure 19. 
SEQ ID NO:9 - Rice-3 AA sequerice 208-288 from Figure 19. 
SEQ ID NO:10 - Rice-4 AA sequence 33-155 from Figure 19. 
SEQ ID NO:1 1 - Rice-4 AA sequence 215-271 from Rgure 19. 



DEPOSITS 

The following vector molecules have been deposited with American Type Culture 
Conecfon 12301 ParKlawn Drive RoCville. MD 20852 U.S.A. on the JL^olZ 

Plasmid BAC.04 was deposited with ATCC on May 8. 1996 as ATCC 97543. 
Plasmid P1-18 was deposited with ATCC on June 13. 1996 as ATCC 97606. 
Cosmid D7 was deposited with ATCC on September 25. 1996 as ATC 97736. 

EXAMPI F.<^ 

Example 1 

.demw^ng N,M1 Cones by map-based cloning. High resoluUon gena.lc capping 
and physical mapping o( NIM1 in Aiabidopsis. 

1. Plant Material and Isolation o( niml Mutants. 

Mm, mutants were isolated trom two Arabidopsis ecotype WsO plant populations 
as descnbed by Delaney et a,., ,,995, PNA8 ag, 6602^eoa. One n,u,an, population was in 
.he form o, an M2 library denved from ethyl methane sultonate ,EMS, mutagenized seeds 
(purchased trom Uhle. Round RocK. TX, and the other was in the .orm of a T-DNA 
popula^on derived ft»n seed obtained from the Ohio state University A^idopsis Biological 
Resource Center (Columbus. OH). 
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The basis of the screen for noninducible immunity (niml) mutants was to screen 
mutagenized plant populations for plants in which resistance to a virulent pathogen could 
not be induced by INA (2.6 dichloro isonicotinic acid; Metraux, et al., 1991 . In: Advances in 
Molecular Genetics of Plant-Microbe Interactions. Vol 1, 432-439. Hennecke and Verma, 
eds.; Kessmann et al. 1993 In: Mode of action of agrochemicals. Y Honma. ed.; Vemooij et 
al. 1995, Molec. PI. Microbe Interaction 8, 228-234). 

Plants from the mutant populations were grown at high density in large trays in 
commercial planting mix. When the plants were 2 weeks of age, the trays were sprayed 
with 0.25 mg/ml INA. Four days later, the plants were sprayed with a spore suspension of 
Peronospora parasitica, isolate EmWa (EmWa), at 5x10" to 1x10* spores/ml. This fungus 
is normally virulent on the Arabidopsis Ws-0 ecotype, unless resistance is first induced in 
these plants with INA or a similar compound. 

Following incubation in a high humidity environment, plants with visible disease 
symptoms were identified, typically 7 days after the infection. These plants did not show 
resistance to the fungus, despite the application of the resistance-inducing chemical and 
were thus potential nim (non-immunity) mutant plants. From 360,000 plants, 75 potential 
nim mutants were identified. 

These potential mutant plants were Isolated from the flat, placed under low humidity 
conditions and allowed to set seed. Plants derived from this seed were screened in an 
identical manner for susceptibility to the fungus EmWa, again after pretreatment with INA. 
The progeny plants that showed infection symptoms were defined as nim mutants. Six nim 
lines were thus identified. One line (nimi) was Isolated from the T-DNA population and five 
from the EMS population. 
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2. Scoring Plant Reactions to INA and Other Chemical Inducers of Disease 
Resistance. 

I. Phenotypic analysis of niml. 

Salicylic acid (SA) and l>enzo(1,2.3)thiadiazo.e.7.cart»miolc add SMBethy. este, 
(BTH) are chemicals ,*e INA, induce b™d spectrum disease resisBnce. termed 
Systemic Acquired Resistance (SAH). in wildtype plants. Since INA did not induce 
resistance in the nmr plants, these plants were also evaluated for their disease resistance 
response following pretrestment with SA and BTH (as partly described in Delaney et al. 
1995, PNAS 92,6602-6606). 

Plants were sprayed with 1. 5, or 15 mM SA or 0.25 mg/ml BTH and challenge 
inoculated with EmWa 5 days later <as descnbed in example 1 above), Bott, SA and BTH 
failed to protect niml plants f„m fungal infecticn, as e»ideneed by the presence o. disease 
symptoms and fungal gn,wth on these plants. Thus, the niml plants were not responsive to 
any of the SAR-inducing chemicals, implying that the mutaf on was downstream of the entry 
point(s) for these chemicals in the resistance induction pathway. 

NM was also evaluated for its disease response to infection with a incompatible P. 
paras^ca isoiates. We.a and Noco ,l,e. these fungal strains do not cause disease on 
Wildtype Ws-O plants), nfmf plants were sprayed with conidial suspensions of S-fOxlo' 
spores/ml Of Wela or Noco and incubated under high humidity for 7 days. Unlike wildtype 
Plants, niml plants developed disease symptoms in response to both Wela and Noco 
infection. The symptoms were necrotic flecking and trailing, with some sporulation. 
Following lactophend blue staln-ng. fungal hyphae were easily observed in the leaves of 
mm, Plants, Thus, the nim1 plants are susceptible to nonnally incompatible P. paras/r/ca 
isolates. This result shows that the nim, plants are not only defective in chemteally induced 
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disease resistance, but are also defective in natural resistance to microorganisms tliat are 
nomially not pathogenic. 

il. Biochemical analysis of niml. 

SA, INA and BTH induce SAR and expression of the SAR genes, which include the 
Pathogenesis Related genes PR-1. PR-2 and PR-5 in Arabidopsis. Since these 
compounds did not Induce disease resistance in niml (as described in example 1 .2 above), 
this mutant line was analyzed for SAR gene expression following SA, INA or BTH treatment. 

After treatment of niml plants with SA. INA or BTH, plant tissue was harvested and 
analyzed for accumulation of RNA from the PR-1 , PR-2 and PR-5 genes. To this end, total 
RNA was isolated from the treated tissues and electrophoresed on an agarose gel. 
Triplicate gel blots were prepared and each was hybridized with a probe for one of these 3 
SAR genes as described in Delaney et at, 1995, PNAS 92, 6602-6606. In contrast to the 
case in wildtype plants, the chemicals did not induce RNA accumulation from any of these 3 
SAR genes in niml plants, as shown In figure 1. Talcen together, the results indicate that 
the chemicals induce neither SAR nor SAR gene expression in nim1 plants. 

Since the chemicals did not induce SAR, or SAR gene expression In niml plants, it 
was of interest to Investigate whether pathogen infection could induce SAR gene 
expression in these plants, as it does in wildtype plants. Ws-O and nim1 plants were 
sprayed with EmWa spores as described and tissue collected for RNA analysis at several 
timepoints. Pathogen infection (EmWa) of wildtype Ws-O plants induced PR-1 gene 
expression within 4 days after infection, as shown In figure 2. In niml plants, however, PR- 
1 gene expression is not induced until 6 days after infection and the level is reduced relative 
to the wildtype at that time. Thus, following pathogen Infection, PR-1 gene expression in 
niml plants is delayed and reduced relative to the wildtype. 
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infection of wildtype plants with pathogens that cause a necrotic reaction leads to 
accumulation of SA in the infected tissues. It has been shown that this endogenous SA is 
required for signal transduction in the SAR pathway, i.e.. breakdown of the endogenous SA 
leads ,0 a decrease in disease resistance. This defines SA accumulation as a marker in the 
SAR pathway (Gaffney et al. 1993, Science 261. 754-756). 

Nimi plants were tested for their ability to accumulate SA following pathogen 
infection. Pseudomonas syringae tomato strain DC 3000, carrying the avrRpt2 gene, was 
injected into leaves of 4-week-old nimi plants. The leaves were han,ested 2 days later for 
SA analysis as described by Delaney et al. 1995. PNAS 92. 6602-6606. This analysis 
Showed that the nimi plants accumulated high levels of SA in infected leaves, as shown in 
Figure 3. Uninfected leaves also accumulated SA. but not to the same levels as the 
infected leaves, similar to what has been obsen^ed in wild-type A^t>i<U>psis. This indicated 
that the n/m mutation maps downstream of the SA marker in the signal transduction 
pathway. This was anticipated, as INA and BTH (inactive in nimi plants) are known to 
stimulate a component in the SAR pathway downstream of SA (Vernooij et al.. 1995, M9i^ 
PL Mcrobe interacticm 8. 228-234; Friedrich. et al.. 1 996. Jhe Plant Journa! 9. in press; and 
Lawton. et al.. 1996. Ihe P^t Joyrna! 2. In press). In addition, as described in Example 
1.2. exogenously applied SA did not protect nimi from EmWa infection. 

3. Genetic analysis of nimi. 

Nimi plants were backcrossed to wild-type Ws-O plants, and F1 progeny were 
tested for resistance to EmWa after INA pretreatment. as described in Example 1 . i above. 
None Of the INA-pretreated F1 plants had infection symptoms, whereas the nimi control 
plants did show infection. Hence, the nimi mutation was detemiined to be recessive. 
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The F2 population from the Ws-O x nimi cross was also assayed for its disease 
resistance after INA pretreatment. Of this population, approximately 1/4 (32/130 plants) 
showed disease symptoms after EmWa treatment of INA pre-sprayed plants and 3/4 
(98/130 plants) showed no disease. These results indicate that the nim mutation identifies 
a single genetic locus and corroborates the F1 data that show the recessive nature of the 
mutation. 

4. Identification of markers in and genetic mapping of the A//M locus. 

For conventional map-based cloning of the NIM gene, markers had to be identified 
that were genetically closely linked to the mutation. This was accomplished in 2 steps. First, 
the nimi plants were crossed to a different Arabidopsis genotype, Landsberg erecta (Ler), 
and F2 plants from this cross which had a nim1 phenotype (i.e. plants that are homozygous 
nim/nim at the MM locus) were identified. From these, plants that had a Ler genotype at a 
nearby DNA marker were identified by molecular analysis. These plants, by virtue of the 
identification criterion, are recombinant between the marker and the A//W locus. The 
frequency of recombinants defines the genetic distance between the marker and the NIM 
locus. 

The second prerequisite for map based cloning is that markers are identified that are 
genetically very close to the /V/M locus, i.e. markers that identify very few recombinants. If 
genetic markers are identified that are very close, than these can be used to isolate 
genomic DNA clones that are close to the /V/M locus. The A/Z/lf locus can then be cloned by 
walking, if not already present on the cloned DNA. Walking can be initiated from both sides 
of the gene. It relies on obtaining overlapping clones that are successively closer to the 
gene of interest. When a single DNA marker is obtained from a walk initiated from, for 
instance, the North end and it identifies no recombinants between this marker and the gene 
of interest, it must be very close to the gene. However, if the marker does identify 
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recomblnanus) Iron, ,he South end. the clone from which .he mart<er was obtained must 
have crossed the gene. By definition then, the get* of interest is doned. i, n,ua, be located 
between this marker and the last North-end marker .ha. Identilies the leas, number of 
recombinants from the North end. 

in a firs. step, a large number of recombinants are generated by genetic crossing. In 
a second step, recombinants that are dose to the W/Mgene are iden«.l«l with ttte use of 
molecular markers. Many markers have been described ,n the literature and several 
methods exist to develop additional markers. Our approach has relied on a number o, 
marker systems, including SSLPs and AFLPs (see below). 

i. Genetic crosses. 

in order to map the chromosomal position of the MMgene relative to the SSLP and 
AFLP maiKers, ™m, was crossed to Ler to make a mapping population. F2 plante from this 
cross were grown and leaves harvested for future DNA extractk,ns. Next, the F2 plants 
were scored for the phenotype, as described in example 1 above. Also, F3 
populations derived from individual F2 plants were grown and scored for the nm, 
phenotype. DNA was extracted from the stored «ssue o,,*,, phenotype F2 and F3 plants 
by the CTab method, as described (Rogers and Bendich. ,988. Plant Molecular Biology 
Manual, A6, 1-10). This DNA was used for mapping the M.*gene. as described below. 

ii. Simple Sequence Length Polymorphism markers. 

The Simple Sequence Length Polymorphism (SSLP) markers ATHGENEA and 
ngaf 1 1 have been described (Bell and Ecker, ,994, Genomics ,9, ,37-,44). Primers used 
lor detection of these SSLPs are listed In Table , 
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Table 1. SSLP primer sequences. 



mr'tmot cot 


primer sequence lo o ) 


ATnCjicNcA (1) 


ACC ATQ CAT AQC TTA AAC 




J 1 U 1 i CI n 




ACA TAA CCA CAA ATA GGG 1 




GTG C 


ATHGENEA (2) 


ACC ATG CAT AGC TTA AAC 




TIC TTQ 




CCA AAT GTC AAA ATA CTC 




GTC 


nga111 (1) 


CTC CAG TTG GAA GCT AAA 




GGG I 




TGT TTT TTA GGA CAA ATG 1 




GCG 


ngalll (2) 


CTC CAG TTG GAA GCT AAA G 




TGT TTT TTA GGA CAA ATG G 



Genetic mapping of the NIMgene relative to maii<er ATHGENEA. 

Using the ATHGENEA (1) primers for PCR amplification of Ler genomic DNA, a 205- 
basepair (bp) band was expected, whereas with Ws-O genomic DNA a band of 211 bp was 
expected (Bell and Ecker. 1994. Genomics 19. 137-144). The amplification products proved 
to be difficult to separate on conventional agarose gels. Hence, two alternative methods 
were developed for separation and detection of these PCR fragments. 

In a first method, primerset ATHGENEA (1) (Table 1) was used to amplify genomic 
DNA in the presence of 6-carboxyrhodamine-labelled UTP (dUTP-Rllo, obtained from 
ABI), yielding rhodamine-labelled PCR fragments. The PCR reactions were analyzed on a 
DNA Sequencer, which detects DNA fragments with single nucleotide resolution. 
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The specific reagents were: IxPCR buffer, 2 mM MgCI2. dNTPs each 200 mM, 2 
mM dUTP-Rl 10. ATHGENEA (1) primers at 0.75 mM. 10 ng DNA and 0.75 units Taq 
polymerase in a 20 ml reaction volume. Amplification conditions were: 3 minutes 94-C 
followed by 35 cycles of 15 seconds at 94X. 15 seconds at SS'C and 30 seconds at 72-C. 
These samples were analyzed on an ABI 377 DNA Sequencer, capable of detecting 
fluorescently labeled DNA fragments with single nucleotide (nt.) resolution. This allowed for 
genotyping the plant samples: a 205.nucleotlde DNA fragment was obtained from Ler DNA 
and a 211-nucleotlde band from Ws-O DNA. Thus. DNA fragments differing by 6 
nucleotides in length could be easily distinguished, allowing for easy genotyping of samples 
as homozygous Ws-O. homozygous Ler and heterozygous Ws-O/Ler at the ATHGENEA 



locus. 



In order ,o increase .he mroughpu. of .his sys.em. a ■nultipl.xing scheme was used. 
Some DNA samples were PCR amplified as described above wl,h primer se. ATHGENEA 

(1) , Whereas Cher samples were analysed wi«, prtmerse. ATHGENEA (2, (lis.ed in Bbie 2), 
in each case in me presence of 6.cari,oxyrt«Kiamine^abeled dUTP. Primer se. ATHGENEA 

(2) was made based on ,he published sequence of ATHGENEA (SImoens e. al.. 1988, 
asna 67, M ,). This primerse. amplified a DNA fragment of 139 bp from Ler DNA and a 
145-bp band from Ws-O DNA. Amp.ifica.ion reac«on condUions ,or primerse. ATHGENEA 
(2) were Menllcal .o ttiose desaibed for primerse. ATHGENEA (1), above. 

Single reacMons using primer se. ATHGENEA (1) and single reacions using primer 
se. ATHGENEA (2) were mixed .oge.her before elec.rophoresis on .he AB. 377 DNA 
Sequencer. This mulUplexing appri«ch allowed for geno^ng 2 samples In a singe lane of 
the Sequencer, one a. posHions 145/139 nt. and one a. posi.ions 21,/2os n.. on .he 
Sequencer. 
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In the second method, PGR fragments were labelled by using a primer labelled with 
the fluorescent dye FAM-6 (6-carboxyfluoresceln) (Integrated DNA Technologies. Inc.). The 
forward ATHQENEA primers of the ATHGENEA (1) and (2) primer sets are identical in 
sequence (see Table 1). This primer was labeled with FAM-6 and used in a PGR 
amplification reaction with the following reagents (Perkin Elmer): IxXL buffer. 1 mM MgGI2. 
dNTPs each at 200 mM, primers each at 0.50 mM (forward primer FAM-6 labeled), 10 ng 
genomic DNA and 0.5 units XL polymerase in a 20-ml reaction volume. The cycling 
conditions were: 3 minutes at 94'*C, followed by 35 cycles of 15 seconds at 94°C, 15 
seconds at 59°C and 30 seconds at 72°C. Again, single reactions using primer set 
ATHGENEA (1) and single reactions using primer set ATHGENEA (2) were mixed together 
before electrophoresis on the ABi 377 DNA Sequencher. This multiplexing approach 
allowed for genotyping 2 samples In a single lane of the Sequencer, one at positions 
145/139 nt. and one at positions 21 1/205 nt. 

All F2 and F3 samples from nimi phenotype plants were scored for their genotype 
at the ATHQENEA locus as described above. All samples that were heterozygous at this 
locus identified plants that were recombinant between the NIMI locus and the ATHGENEA 
locus. In a population of 1 144 F2 nimi phenotype plants and F3 nimi phenotype 
populations that were scored in this way, 98 were heterozygous at the ATHGENEA locus, 
giving an estimate of the genetic distance between this SSLP locus and the NiMI locus of 
4.3 cM. This established that the NIMI locus was on chromosome 1, near the ATHGENEA 
marker. 

Genetic mapping of the NIMI gene relative to marker nga1 1 1 . 
Two primer sets for SSLP marker ngalll (described in Bell and Ecker, 1994. 
Genomics 19. 137-144) were used to amplify genomic DNA of F2 and F3 nimi phenotype 
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Plants an. conTo, W.o and Ler p,a„.s. P,^, „g,„ , ,„ ,„ 3^,, 

Ecker, 1994, Genomics 19. 137-144 and Itered In Table „ .sad under the ,ol,ow,„g 

ng DNA and 0.75 unl. Ta, pCy^e^se In a 20 n,, reao«on volume. Primer sa, nga„ 1 ,a, 
(listed in Table 1, and a derh,a,.ve o, pri.er se. nga, 1 , „„ „as used under dl„era„. 
conditions. IxPCR bu«er, 1.5 mM MgC.2. dNTPs «.cH 200 mM, prtmers a, 1 mM. ,0 ng 
DNA and 1 unft Ta, polymerase In a 20 m, ,eac«on volume. Both reactions were amp,i«ed 
by incubation at 94-0 .or 1 minute, toilowed by 40 cycles o, 15 seconds at 94-C, IS 
seconds at 55°C and 30 seconds at 72"C. 



The sampies were analyzed on 3-5% agarose gels. The band obtained from 
ampiiticauon o, Ws-O DNA with either phmer se. was 146 bp, whereas amplifying Ler DNA 
resulted In a 162-bp band. Pian. samples that were heterozygous at the nga, 1 1 locus 
identified plants that were recombinant between this 8SLP marKer and ,he M»,iocus 
Among 1 ,44 F2 phenotype plants and P3 nl., phenotype populations, 239 were 
.dentitied as heterozygous ,or the nga, 1 , marKer, giving an estimate ,or the genetic 
distance between me SSLP marker and the M«„oc„s o, ,0.4 cM. This corroborated that 
me MM, .ecus was on chromosome , . Since ,ew n/m, phenotype plants existed that were 
heterozygous a. both ATHGENEA and nga, „ , the MM, locus was determined to be 
be«»een these 2 markers, with ATHQENEA located North o, the MM, gene and nga, 1 , 
located south o, «,e MM, gene. This placed the MM, gene approximately ,0 north o. 
nfla,,,, near position 85 on chromosome , (Lister and Dean, ,993. Piant J. i. 745-750; 
Bell and Ecker. 1994. Genomics 19. 137-144). 
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iii. Amplified Fragment Length Polymorphism markers. 

For map-based cloning of the NIM1 gene, it is necessary to identify molecular 
markers that are successively closer to this gene. For this purpose. Amplified Fragment 
Length Polymorphism (AFLP) markers were generated by using the selective restriction 
fragment amplification method described by Zabeau and Vos (1993, European Patent 
Application EP 0534858) and Vos et ah (1995, Nucleic Acid Research 23, 4407-4414). 

Outline of the AFLP Technology. 

The use of the AFLP technology in mapping relies on selective amplification of a set 
of DNA bands in 2 genetically distinct samples. Finding that any of the obtained bands are 
different between the 2 genotypes Identifies those bands as markers for that genotype. If 
the marker cosegregates at high frequency with the gene (mutation) of interest, then the 
marker is close to the genetic locus. 

Selective amplification of a small set of DNA fragments in a complex DNA sample is 
achieved in a 2-step process. First, DNA fragments are generated by digesting the DNA 
with restriction enzymes, followed by ligation of adapters to the ends. Second, primers 
consisting of a sequence complementary to the adapters plus a 3' extension (typically 0-3 
nucleotides) are used to amplify only those DNA fragments with ends that are 
complementary to these primers. If a single nucleotide extension is used, then theoretically, 
each primer will "fif on approximately 1/4 of all fragments, with 1/16 of all fragments having 
a primer fit on both ends. Thus, a limited set of DNA fragments is amplified with these 
primers. By further radiolabelling one primer, an even smaller subset of visible bands can 
be obtained. 
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AFLP analysis. 

For AFLP analysis of DNA sa„,pte,, 50 ng ONA was digesled with ,he appropriate 
enzymes (usually EcoRI and Msel; see below, and adapters (listed In ta«e 2 below, ware 
ligated to the restriction ,«g™n,s (usually EcoRI and Msel,. The sequences of the primers 
and the YAC. P, and BAG clones are described In delal, below. The templates were used 
for amplincation reac«ons (app,oxln,ately 0.5 ng ONA per reaction,, using primers that were 
complementary to the adapters, with short extensions ,2 or 3 nucleotides; primer 
sequences are listed below). Since one of the primers Is radtoactlvely labelled (usually the 
EcoR, Primer,, only a subset of the amplified fragments Is visible upon autoradiographic 
analysis of the gel used to separate the bands. 
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Amplification conditions for cloned DNA (YAC, PI, cosmid) were as follows: 36 
cycles of 30 sec. at 94°C (denaturation), 30 sec annealing and 1 min extension at 72''C. 
The annealing temperature in the first cycle was 65°C and was reduced by O-T^C in each 
cycle for the next 12 cycles and then kept at seX. For genomic DNA of Arabidopsis plants, 
the amplification was performed in 2 steps: in the first step (preampllfication), the DNA was 
amplified with primers that have a single nucleotide extension (neither primer was labeled). 
Reaction conditions for this amplification reaction were: 20 cycles of 30 sec. denaturation 
(94'C), 1 min annealing (Se-C) and 1 min extension (72°C). In the second step, the first 
amplification reaction was diluted 10 fold and reamplified 36 cycles with primers containing 
the full-size extensions (using one labeled primer) under the following conditions: 30 sec. at 
94''C (denaturation), 30 sec annealing and 1 min extension at 72''C. The annealing 
temperature in the first cycle was esX and was reduced by O.T'C in each cycle for the next 
12 cycles and then kept at se'C. The final reaction products were separated on a 
polyacrylamide get and the gel was exposed to film, allowing visualization of the 
radiolabeled PCR bands. When this procedure was applied to DNA from 2 genotypes 
simultaneously, AFLP bands were identified that were diagnostic for one genotype or the 
other. Such bands are called informative AFLP bands, or AFLP markers. Table 2 shows 
Adapters used in the AFLP analysis. 
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Table 2 



enzyme adapter 



EcoRI 5'-CTCGTAGACTGCGTACC-3' 

3'-CATCTGACGCATGGTTAA-5' 

Hindin 5'-CTCGTAGACrGCGTACC-3' 

3'-CATCTGACGCATGGTCGA-5' 

PstI 5'-CTCGTAGACTGCGTACATGCA-3' 
3'-CATCTGACGCATGT-5' 

Msel 5'-GACGATGAGTCCTGAG-3' 
3'-TACTCAGGACTCAT-5' 



Generation of AFLP markers and fine mapping of the MM1 locus. 
A population of recombinant inbred lines derived from a cross between the Arabidopsis 
ecotypes Landsberg erecta (Ler) and Columbia (Col) (Lister and Dean. 1 993. Plant J. 4. 
745-750) was used for AFLP marker screening. The primers used for the AFLP screening 



were: 



EcoRI-primers: 5'-GACTGCGTACCAATTCWN-3' 
Msel-primers: 5'-GATGAGTCCTGAGTAAXWN-3' 

An "N" in the primers indicates that this part was variable (A. C, G or T). a "W 
Indicates A or T. and an "X" indicates a C. All 8 possible primers were used for both the 
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EcoRI- and Msel-primer. This gave a total of 64 (8 x 8) primer combinations (PCs) that 
were used to amplify DNA from the recombinant inbred line and the parental genotypes, Ler 
and Col, as described above. The amplification reactions were run on a denaturing 
polyacrylamide gel to separate AFLP fragments by size and the gel was exposed to film. 
The film was inspected for bands that were present in only one genotype, i.e. inspected for 
AFLP markers. 

The AFLP markers, i.e., DNA fragments that are polymorphic between both parents 
of the recombinant inbred lines, were used for constructing a genetic map of the 
recombinant inbred line population. Example 1 .5i below describes the mapping of the NIM1 
gene on /*ra/b/c(ops/s chromosome 1. at approximately position 85. Those AFLP markers 
that had been mapped (using the recombinant inbred line) between positions 81 and 88 of 
Arabidopsis chromosome 1 were chosen for analyzing recombinant plants for the presence 
of said AFLP markers and thus for mapping the NtMl gene more precisely. Seven AFLP 
markers from this region were identified as being informative; they were polymorphic 
between both parents of the nimlxLer cross. Six AFLP markers were Ler-specific, i.e. 
these AFLP markers were absent in Ws (and in Col as well). One AFLP marker was Ws- 
specific, i.e. a Col-specific AFLP marker (absent in Ler) was also present in Ws. These 
AFLP markers are: L81 J , L81 .2. W83.1 , L84. L85, L87 and L88 (an L-marker is specific for 
ecotype Ler and a W-marker is specific for both the ecotypes Col and Ws; the number 
indicates the map position). These AFLP markers were used to analyze recombinant plants 
from the nimlxLer cross (see below). In addition, AFLP marker C86 (a recombinant, inbred- 
line-derived marker specific for Col) was used in isolating DNA clones (see below). Table 3 
lists the primer sequences that were used to obtain these AFLP markers. 
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Table 3 shows primer combinations of AFLP markers derived from recombinant 
mbred line populatton. 

"EcoRI-- refers to the sequence 5'-GACTGCGTACCAATTC-3' and 
"Msel-" refers to the sequence 5'-GATGAGTCCTGAGTAA-3'. 

Table 3 



AFLP marker Corresponding primer combinations 



L81.1 


EcoRI-CA 


Msei-CCG 


LSI. 2 


EcoRI-AA 


Msel-CAA 


W83.1 


EcoRI-CA 


Msel-CTC 


L84 


EcoRI-AAT 


Msel-CAA 


L85 


EcoRI-CA 


Msel-CCT 


L87 


EcoRI-CA 


Msel-CTT 


L88 


EcoRI-AG 


Msel-CTA 


C86 


EcoRI-AG 


Msel-CCT 



A detailed genetic map of the region was constructed using the AFLP markers 
described above by typing the recombinants. A total o. 337 recombinant plants were 
available out of 1144 F2 mm, plants. These recombinants were first screened witt, the 
North-flanKlng AFLP markers L8t,2 and ATHGENEA and the South-flanking markers L88 
and ngal 1 1. Forty-eight plants were homozygous nlrnVniml and heterozygous at 
ATHGENEA and L81.2. and 2, Plants were homozygous r,/m,«m, and heterozygous at 
ngal 1 , and L88. These recombinant plants were further analyzed with 9 AFLP markers in 
the «/M,eglon. Including 4 AFLP markers that were derived from the recombinan, inbred 
>lne mapping population, (W83.1, L84, L85 and L87) and 5 AFLP markers derived from 
analysis of YAC clones (W83.3W84.1, W84.2, W8S.1, W86.1 and L86. see betow). 

The genetic map of NtMI. based on this analysis, is depicted in Figure 4. As seen. 
27 recombinants were found between marker W84.2 and NIMI and 14 recombinants were 
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tound between W85.1 and NIMt Marker L85 is linked closely to NIMI, but this marker 
could not be mapped on the YAC, BAG or P1 clones (see below) and was, therefore, not 
useable for identification of the NIMI gene. 

5. Physical mapping of the NIMI region, 

i. Isolation of YAC clones using AFLP markers closely linked to NIMI. 

The CIC library, an Arabidopsis ecotype Columbia YAC library (Bouchez et al, 1 995. 
6th Int. Conf on Arabidopsis Research, Madison, Wl), was screened for YAC clones in the 
A//yW region. This library has about 2.5 nuclear genome equivalents and has an average 
insert size of 450 kb. The YAC library was screened with two AFLP markers: W83.1 and 
C86. W83.1 is the most closely linked recombinant, Inbred-line-derived AFLP marker north 
of NIMt and C86 is a recombinant, inbred-line-derived AFLP marker specific for Col 
(absent in Ler and Ws). C86 mapped south of the NIMI gene on the map of the 
recombinant inbred line population. This Col AFLP marker has been used instead of the 
closely linked Ler AFLP markers (Figure 4), because the latter AFLP markers detected only 
ecotype Landsberg erecta and hence cannot be used for screening the Columbia YAC 
library. 

The YAC library was screened in two steps. Firstly, the cells of the YAC clones of 
each plate of the twelve 96-well microliter plates were pooled (a plate pool) and used for 
DNA isolation as described by Ross et al (1991 . Nucleic Acids Res. 19. 6053), The pools 
were screened with both AFLP markers. Subsequently, from each positive plate pool, the 
DNA samples of each row (a pool of 8 clones) and of each column (a pool of 12 clones) 
were screened with the AFLP marker for which the plate pool was positive. In this way, the 
individual positive YAC clones could be identified. The screening yielded a total of 4 YAC 
clones: YAC 12F04 and YAC 12H07 were isolated using the North AFLP marker W83.1. 
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and YAC 10G07 and YAC 7E03 using the south AFLP marker C86 (for the nomenclature of 
the YAC clones the CIC numbering is used). The YACs were "fingerprinted" by AFLP. giving 
YAC-specific AFLP fragments. Fingerprints of the YACs were compared and used to 
estimate overlaps between the YACs (see also Tables 5 and 6). Based on the AFLP 
fingerprints, clone 7E03 is essentially covered by clone 10G07 (see also table 5) and clone 
12H07 is likewise essentially covered by clone 12F04 (see also table 6). 

ii. Generation of AFLP mari<ers from YAC clones. 

Since the AFLP markers described above were genetically relatively far from the 
NIM1 gene (see figure 3). additional AFLP markers were developed in an effort to find 
markers that were closer to the NIM gene. 

A screening for additional YAC-derived AFLP markers was performed on DNA 
samples of the following: DNA of the isolated YAC clones (4 YACs were Identified, as 
described above), the yeast strain without a YAC. and the three Arabidopsis ecotypes Col. 
Ler and Ws. In this way. the fragments specific for the YAC clones (absent in the yeast 
strain and present in Col) could be tested for polymorphism in Ler and Ws (the parents of 
the recombinant plants identified in Example 1.5 below). All Identified polymorphic 
fragments would thus be additional AFLP markers. In the first AFLP screening the enzyme 
combination (EC) EcoRI/Msel was used, in this screening two YAC clones. 10G07 and 
7E03 (detected with AFLP marker C86. see below), the yeast strain without a YAC and the 
three Arabidopsis ecotypes Col. Ler and Ws were assayed. The primer combinations with 
the selective extensions used can be divided into three groups and are depicted in Table 4. 
A total of 256 (64 + 96 + 96) primer combinations were screened. 

In Table 4 below the primer sequences used In the AFLP screening of two YAC 
clones. 10G07 and 7E03. the yeast strain without a YAC. and the three Arabidopsis 
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ecotypes Col, Ler and Ws are shown. Three groups of primer combinations have been 
used. An *N" in the primers indicates that this part was variable (A, C, G or T), an "S" 
indicates C or G, a "W" indicates A or T, and a "Y" indicates C or T. 

Table 4 



EcoRI-primers: 

5'-GACTGCGTACCAATTCGW-3' 

5'-GACTGCGTACCAATTCTS-3' 

Msel-primers: 

5'-GATGAGTCCTGAGTAAAAS-3' 
5'-GATGAGTCCTGAGTAAASA-3' 
5'-GATGAGTCCTGAGTAAATN-3' 
5'-GATGAGTCCTGAGTAACAN-3' 
5'-GATGAGTCCTGAGTAACTN-3' 



EcoRI-primers: 

5'-GACTGCGTACCAATTCAN-3' 
5'-GACTGCGTACCAATTCCW-3' 
5'-GACTGCGTACCAATTCTW-3' 
Msel-primers: 

5'-GATGAGTCCTGAGTAAAAS-3' 

5'-GATGAGTCCTGAGTAAASA-3' 

5'-GATGAGTCCTGAGTAAGAY-3' 

5'.GATGAGTCCTGAGTAAGTW-3' 

5'-GATGAGTCCTGAGTAATCG-3' 

5'-GATGAGTCCTGAGTAATCT-3' 

5'-GATGAGTCCTGAGTAATGW-3' 
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EcoRI-primers: 

5'-GACTGCGTACCAATTCGW-3' 

5'.GACTGCGTACCAATTCTN-3' 
Msel-piimers: 

5'-GATGAGTCCTGAGTAAGAW-3' 

5'-GATGAGTCCTGAGTAAGCW-3' 

5'-GATGAGTCCTGAGTAAGTW-3' 

5'-GATGAGTCCTGAGTAATAN-3' 

5'-GATGAGTCCTGAGTAATCW-3' 

5'-GATGAGTCCTGAGTAATGW-3' 

5'-GATGAGTCCTGAGTAATrS-3' 



In total. 83 Col-speclfic fragments were generated, of which 62 were shared by both 
YAC Clones. Three fragments were AFLP markers polymorphic between Ws and Ler of 
Which two were Ws AFLP markers (a Co. fragment also present in Ws and absent In Ler, 
and one was a Ler AFLP marker (a Co, fragment also present in Ler and absent in Ws,. 
These results are presented in Table 5 below. 

Ws and Ler genotypes. 
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Table 5 

AFLP fragments in YAC clones AFLP marker 

10G07 7E03 Ws Ler 

shared 62 62 2 1 

unique 21 0 0 0 

This AFLP analysis thus yielded 3 new AFLP markers (see Figure 4 and below). 
Their positions relative to each other and relative to the recombinant, inbred-line-derived 
AFLP markers were determined by analysis of the recombinants with these AFLP markers. 

A second screening for AFLP markers was performed assaying all four identified 
YAC clones (see below) and using the enzyme combination Pstl/Msel. The primers used 
are: 

Pstl-primers: 

5'-GACTGCGTACATGCAGAN-3' 
5'-GACTGCGTACATGCAGCW-3' 
5'-GACTGCGTACATGCAGGW-3' 
5'-GACTGCGTACATGCAGTN-3' 

Msel-primcrs: 

5'-GATGAGTCCTGAGTAAAN-3' 
5'-GATGAGTCCTGAGTAACW-3' 
5'-GATGAGTCCTGAGTAAGW-3' 
5'-GATGAGTCCTGAGTAATN-3' 

An "N" in the primers indicates that this part was variable (A, C, G or T) and a "W" in 

the primers indicates that this was A or T. A total of 144 (12 x 12) primer combinations was 
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scree^d on all four isolated YAC clones, ,2F04. ,2H07. ,0G07 and 7E03; me yeas, strain 
wiU,out a YAC; and .he .hree Arabldopsis econ-pes Col. Ler and Ws. In toial, 219 AFLP 
fragments we,e generated, o. which 144 were present in YAC clones ,2F04 and ,2H07 (72 
were unique lor clone 12F04 and 72 were shared between bo«, YACs, and o. which 75 
were present in YAC clones 10G07 and 7E03 (33 were unique for clone 10Q07 and 42 
were shared between the 2 YACs,. Three fragments derived from the «rs. set of YAC 
Clones were polymorphic (Ws AFLP markers,. These results are presented In Table 6 
below. 

and I °' """"" "^'"^ ""^C^- <n YACS 

and the number of informative AFLP marker«5 amAnn th^^ * 

gg^^jypgg "-^ markers among these fragments in Ws and Ler 

Table 6 



number of AFLP fragments in YAC clones ^FLP markers 





12F04 


I2H07 


I0G07 


7E03 


Ws 


Ler 


shared 


72 


72 


0 


0 


1 


0 


unique 


72 


0 


0 


0 


2 


0 


shared 


0 


0 


42 


42 


0 


0 


unique 


0 


0 


33 


0 


0 


0 



The results Indicate that YAC done 12H07 Is par, of the larger YAC clone 12F04 
and that YAC clo,« 7E03 is a par, of the larger YAC Cone ,0007. These data lndto.e «,at 
.he larger YAC clones. ,2F04 and 10G07. do no, cedap. These data do not allow the 
positioning o, me MM, gene on any o, these YAC clones. The whole screening, involving 
400 primer combinations producing 302 AFLP figments in me MMregion. yielded 5 useful 
AFLP mart<ers. o,w..ch 4 were Ws-specific and one Ler-specific. These 5 additional AFLP 
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markers have been mapped by analysis of recombinant plants (see figure 4 and below) and 
are denominated W84.1 (a.k.a. W83.3), W84.2, W85.1, W86.1 and L86. 

Table 7 lists the primer sequences used to obtain these AFLP markers. These 5 
additional AFLP markers raised the total number of AFLP markers to 12 in the region from 
L81 .1 to LBS (see figure 4 and below). 

Table 7 shows primer combinations of AFLP markers derived from YAC clones. 

"EcoRI-" refers to the sequence 5'-GACTGCGTACCAATTC-3'. 
"Msel-" refers to the sequence 5'-GATGAGTCCTGAGTAA-3' and 
Tstl " refers to the sequence 5'-GACTGCGTACATGCAG-3'. 

Table? 



AFLP marker 


Primer combination with selective extensions 


W84.1 


Pstl-AT 


Msel-TT 


W84.2 


Pstl-AA 


Msel-TT 


W85.1 


EcoRI-CT 


Msel-GTA 


W86.1 


EcoRI-GT 


Msel-CTT 


Lg6 


EcoRI-GT 


Msel-CTT 



This information was used to construct a physical map of the region, as shown in 
Figure 5, with approximate positions of the YAC clones, relative to the genetic map. The 
map showed that the region containing the NIM1 locus, between markers W83.1 and 
W85,1, is partly covered by 3 YAC clones: 12F04 and 10G07/7E03. 
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«i. Construction of a P1/BAC contig oontaWng the NIM1 gene. 
in U,e previous sections i, was described how AFLP markers linked to the MiW, 
region were isolated and how VACs corresponding to these ntarKers were identified and 
mapped. The resUts obtained while loCzing the MM, gene to a chromosome tragmen. 
did no. ailowthe definition o, a specific DMA segment containing the Miw, gene- the 
Nanking AFLP markers were mapped to dltterent VACs that did not overlap it was 
.heretore, not poss*,e ,o detemtine the precise ph^ica, position of the M«, gens,- it couid 
be located on either of the ,wo VACs or in the gap between the VACs. An alternative 
approach was selected to close the physical gap between the fl«,king markers: a P, and 
BAG i^rary were employed to bddge the gap between .he flanking AFLP markers 

The iibranes used for gap closure were an Arabidopsls ecotype Columbia P, library 

described by Liu et at CQie Pjaol J. 7 351-358 iqqk\ 

HoiiL /. dsi 358. 1995) and an ecotype Columbia BAG library 

descHbed by Choi et a, ,h..p/genome «nford.edu/Arabidopsis/ww/Vo,2, cho, html, 

The P, -library consists o, about ,0.000 ..ones with an average ^ser, size of 80 kb and the 

BAC library consists o, about 4000 clones ^th an average insert si« o, ,00 Kb. in theory 

these libraries represent about 10 nuciMrn^r,^ • . 

10 nuclear genome equivalents (assuming a haploid 

genome size for Arabidopsis of 120 Mb). 
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iv. Identification of PI clones corresponding to the flanking markers. 

The flanking markers Ws84.2 and Ws85.1 were used to screen pools of PI clones 
using a similar strategy as previously described for screening of the YAC library (see 
Example 1 .5i). PI clones having the marker fragments were selected and "plasmid" DNA 
was isolated. The various PI done DNAs were fingerprinted using the ECs EcoRI/Msel and 
Hindlll/Msel and primers without selective nucleotides. A physical map was constructed, i.e. 
a map giving the size and overiaps of the clones, by comparing the AFLP fingerprints. The 
number of AFLP fragments that are unique and the number of AFLP fragments that are 
common between clones indicate the extent of the overlaps. The map is displayed in Figure 
6. The AFLP fingerprinting revealed that two sets of non-overiapping PI -contigs had been 
constructed each containing one of the flanking markers: Pl-1 and P1-2 containing marker 
Ws84.2; P1-3 and P1-4 containing marker W85.1. Consequently, the gap between the 
flanking markers was not closed (Figure 6). 

The positions of the PI contigs with respect to the YAC contig was determined by 
AFLP fingerprinting of the YACs and PI clones with a number of YAC-specific PCs 
described above. PI clones Pl-1 and PI -2 appeared to overiap completely with YAC 
CIC12F04, but only partially with YAC CIC12H07. Therefore, the latter PI clones could be 
positioned on the YAC contig CIC12H07/12F04 (Figure 6). PI clones PI -3 and PI -4 
overlapped completely with both YACs CIC7E03 and CIC10G07 and it appeared that AFLP 
mariner W86.1, like W85.1 , was mapped to this PI contig (Figure 6). 

Next, marker L85 was used to identify corresponding PI and BAC clones. LBS is an 
ecotype-Landsberg-specific marker and, therefore, colony hybridization of radroactlvely 
labelled L85 DNA to PI and BAC filters was employed. Not a single PI or BAC clone 
hybridizing to L85 was identified. This supported our earlier findings that the LBS sequence 



SUBSTITUTE SHEET (RULE 26) 



wo 97/49822 



PCT/EP97/01218 



44- 



is lacking in the Ar^tidopsis ecotype Columbia genome and is. therefore, the most likely 
explanation of why no con-esponding clones were Identified. 



V. Extending the AffMy-flanking Pi Contigs. 

Various approaches were employed to extend from the flanking Pi contigs: 

YAC AFLP fragments specific to the South end of YAC CIC12F04 (unique to 

C.C12F04. not present in C.C 12H07) were used to Identify Pi clones by AFLP screening of 

pools of the library. 

1. YAC AFLP fragments from VAC 10Q07 and overlapping with PI.4 were 
used .0 Uemy PI clones by AFLP screening of pools of the PI library. 

2. Ecom restriction fragments from PI done Pl-s (resulting from the AFLP- 
based P, library screening of step ,) were used as hybridization probes on 
filters of the BAG library. 

various P, and BAG clones resulted from this screening and all were AFLP- 
fingerprinted with the ECs EcoRI/Mse. and Hind,ll,Mse, using pri.ers withou, selective 
nuoleoiides. A new map was constmoted as described above and is depleted in Figure 7 
Table 8 Shows the vartous AFLP PCs having AFLP fragments mapped .0 fianKing YACs 
and used to screen the Pi-library for corresponding PI clones. 

Table 8 represents the «.rlous AFLP PCs used to screen fhe P, library. The top half 
of the table shows PCs specific for the North VACs and ,he bottom half shows .he PCs 
spedfic for the South YACs. Also indicated are the YACs and P, donee wherein ihe AFLP 
fragments were detected. 
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Table 8 



AFLPPCs 


CIC YACs 


PI -clones 


Comments 


Pstl-AA Msel-TT 


12F04andl2H07 


Pl-l.Pl-2 


Marker Ws84.2 


Pstl-AT MscI-GT 


12F04-specific 


Pl-l,Pl-2 




Psd-CA Msel-AA 


12F04-specific 


Pl-6 




Pstl-AC Msel-TG 


12P04-specific 


Pl-7 




Pstl-AG Msel-TG 


1 2F04-specific 


PI -7 




Pstl-CT Msel-GT 


12F04-specifjc 


Pl-7 




EcoRI-CT Msel-GTA 


10G07 and 7E03 


Pl-3,Pl-4 


Marker Ws85.1 


EcoRI-GT Msel-CTT 


10G07 and 7E03 


Pl-4 


Marker Ws86.1 


EcoRI-AA Msel-GT 


10G07and7E03 


Pl-4, Pl-9 




EcoRI-AT Msel-GA 


10G07 and 7E03 


Pl-4, Pl-9 




EcoRI-GG Msel-CT 


10G07 and 7E03 


Pl-4, Pl-9 










A 



P1/BAC contig of about 250 kb was obtained covering the South end of YAC CIC12F04 
(not extending from this YAC) and containing marker W84.2. A PI contig of about 150 kb 
containing markers W85.1 and W86.1 was obtained; this contig is completely contained 
within YAC CIC7E03. 

Construction of a P1/BAC contig covering the NIMI gene AFLP marker analysis on 
the recombinants with markers from the South end of the North P1/BAC contig (WL84.4 
and WL84.5. see below and table 11) showed that the previous "walking" steps were 
unsuccessful in the construction of a contig containing the NIM1 gene (see next section). 
Therefore, the existing North P1/BAC contig was extended South with the purpose of 
"walking" across the NIM1 gene, which would enable the definition and isolation of a 
specific DNA segment containing the NIM1 gene. A hybridization-based approach was 
followed in which PI or BAG clones located at the South end of the North PI/BAG contig 
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were used ,o ,den«y clones positioned closer ,o A,,*., (Sou«, bound). New clones resulBng 
from ,he walking steps were mapped with respect to the existing contlgs using AFLP 
fingerprinting with the ECs EcoRI/Msel and HIndlll/Msel as described above. A total of 5 

subseguent warning steps appeared to be necessary to -cross" the gena. Table 9 

shows the clones obtained In the vaHous walking steps. 

Table 9 is an oven,iew of the various walking steps showing the hyb.«lz,tlon probe 

used ,0 screen the P, and BAC libraries and the selected Cones hybridizing to the probes 

and extending in the South direction. 



Table 9 



New clnnP^ ^ xtenHinp <;»..n^ 
Stepl: PI -7 BAC-02 

Step 2: BAC-02 Pl-16. BAC-03 

Step 3: BAC-03 P1-17,P1-I8 

Step 4: Pl-18 PI-21, Pl-20. BAC-04 

Step 5: BAC-04 Pl-22. Pl-23, Pl-24. BAC-06, BAC-05 



A physical n.ap of the various Cones resulting from this walking effort Is depicted in 

marker W84.. The South end of the contig presented in Figure a appeared to contain the 
NIM1 gene (see next section). The contig extends .ore than 300 Kb South fro. VAC 
CIC12F04 and appeared not to ovedap with YACs C,C10G07 and C.C7E03. indicating that 
the N^MI gene is in the gap between the flanking YAC contigs and that this gap is at least 
300 kb. 
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vi. Construction of an Integrated Genetic and Physical Map of the NIM1 Region. 

In the previous sections it was described how AFLP marl<ers linked to the NIM1 
region were isolated, how YACs corresponding to the flanking markers were identified, and 
how a P1/BAC contig was constructed extending about 550 kb South from the closest North 
flanking AFLP marker W84.2. This section describes the generation of new AFLP markers 
from the P1/BAC contig, the physical mapping of these markers on this contig and the 
genetic mapping of thisse markers with the available recombinants. 

1. Generation of New AFLP Markers from the P1/BAC Contig 

As described in the previous section, the PI and BAG clones of the contig extension 

were characterized by AFLP fingerprinting using the ECs EcoRI/Msel and HIndlli/Msel. This 

defined quite accurately the extent of the overlaps between the various PI and SAC clones 

and, in addition, generated a number of AFLP fragments specific for these clones. AFLP 

primers without selective nucleotides are used In fingerprinting of purified plasmid DNA of 

the PI or BAG clones. Selective nucleotides will be necessary, however, to be able to use 

these PI or BAC-speclfic AFLP fragments for detection in Arabidopsis, By determining the 

end sequences of the amplified restriction fragments. AFLP primers having the appropriate 

selective bases can be designed to amplify the PI- or BAC-specific AFLP fragment in 

Arabidopsis. All AFLP fragments originate from the ecotype Columbia (Col) and. therefore. 

it also should be determined if the Columbia AFLP markers are informative In the NIM1 

recombinants which are derived from a cross of the ecotypes Landsberg erecta (Ler) and a 

nim1 mutant of the ecotype Wassilewskija (Ws-mm). In principle, there are 4 types of AFLP 

fragments, two of which are useful markers as Indicated in Table 10 below: 

Table 10 is an overview of the types of AFLP markers found. (+) or (-) indicates 
presence or absence of the AFLP fragment. 
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Table 10 



Col 


Ler 


Ws-n/m 


marker-type 




+ 




not informative 


+ 


+ 




Ler marker 


+ 




+ 


Ws marker 








not informative 



AFLP fragments and eo ,o 80 Hlnd,„/Mse, AFLP .ag.e„« ,or each indMdua. Cone The 
end sepuences o, .ndMdua, ..g.en.s were de.ernined standard sepuenZ 
pues. Me., specie AFLP p..ere sets with seleot^e extensions o, 3 nuCeoUdes or 
^.h..e EcoR, or H,nd,n printer and the Mse, pdmer were tested on the ,o„ow,na Pane, o, 

1 . P1/BAC clone from which the AFLP marker was derived 
2a. Yeast 

2b. YAC clone CIC12F04 (only for AFLP fragments from P1-7) 2c YAC clone 
CIC10G07 ^ 

3a. Cd, origin of the PI and BAC lituarles 
3b. Ler, parent 1 of the nlm recombinants 
3c. Ws-nim. parent 2 of the nlm recombinants 

H,ndin/Msel AFLP fragments: BAC-01/P,.7. P1.,7/P,.18. BAC.04/BAO.oe The AFLP 
"o- o^e P,-7 were a« detected in VAC C,0,aF04, indicating ,.a, this Cone is 
comp etely contained within this YAC. None o, the P,/BAC-specl,ic AFLP fragments was 
detected ,n VAC Cone C.C,0<B07. indicating .ha, the P,/BAC contig does no bri e"he 
.ap between the two «an«ng VAC contigs. AFLP mari,ers seiected L anal^ J he ^ 
recombinants are depicted in Table li . y "une mm 

Table 1 l is an overview of the selected AFLP markers from the AFLP PCs specific 
for the vanous P1 and BAC clones. A "WL" marker « m=ri,« • • • 
pn,„HH- . ^ '"^'^ker origmating from the same 

PC and displaymg two AFLP markers, a Ws and a I mori, 

_ ' 3 "-er marker, which appeared to be 

completely linked in repulsion phase upon analysis of the MM recombinants. 
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Table 1 1 


Origin 


Marker name 


[/ililicr^ l^UiuDlliaUOIl 


PI-7 


WL84.4 




Pl-7 


WL84.5 




P1-17/P1-18 


Ler84 6a 


HinHTIT-PfrT M»»T ATT 


r 1- 1 //r l-1 0 


Ler84.0D 


Hindni-ATT Msel-CAT 


PI-18 


Ler84.6c 


Hincffll-TCT Msel-TAT 


Pl-18 


Ler84.7 


EcoRI-AAA Msel-AGA 


BAC-04/06 


Ler84.8 


EcoRI-TTC Msel-AGT 


BAC-06 


Ler84.9a 


EcoRI-AAA Msel-TGT 


BAC-06 


Ler84.9b 


EcoRI-ATC Msel-TCC 


BAC-06 


Ler84.9c 


EcoRI-ATG Msel-GTA 



2. Physical Mapping of tlie New AFLP Markers. 

The AFLP markers described above were physically mapped by detecting their 
presence in the various PI and BAG clones. The results are presented in Rgures 9-1 1 . 

3. Genetic Mapping of the New AFLP Markers. 

The AFLP markers were all analyzed on a selected set of the recombinants. The 
results obtained are summarized in Tables 12a, 12b and 12c. 
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The AFLP markers Ler84.8, Ler84.9a. Ler84.9b and Ler84.9c appeared to map at 
the South side of NIM1. Recombinants were found that were phenotypically nimi 
(homozygous, genotype Ws-n/mr/Ws-n/m/) and heterozygous for these AFLP markers (the 
Ler-specific AFLP marker was detected, genotype is Ws-n/m7/Ler). AFLP marker Ler84.8 
appeared to be closest to NIMI: only a single recombinant (C-1 05) was scored as 
heterozygous Ws-n/m//Ler and homozygous Ws-n/m7/Ws-n/mr. AFLP markers Ler84.7 
and Ler84.6c appeared to completely cosegregate with NIMI: all recombinants had an 
identical NIMI and AFLP marker genotype. North of NIMI, marker L84.6b appeared to be 
closest to NIMI: three nimi phenotype recombinant plants. C-074, D-169 and E-103 (Table 
12c), were found to be heterozygous Ws-n/m7/Ler at this marker. With the aid of the 
cosmid contig generated from PI -18. BAC-04 and BAC-06. AFLP markers Ler84.6b and 
Ler84.8 were mapped in PI -18 and BAC-04, respectively, and found to have a physical 
distance of approximately 110 kb. This defines nimi to be located on a DNA segment 
estimated to be 110 kb in length. From this analysis it has been determined that the NIMI 
gene is contained in clone BAC-04 or P1-18. Clones BAC-04 and PI -18 have been 
deposited with ATCC and given deposit numbers ATCC 97543 and ATCC 97606, 
respectively. 

vii. Genetic and Physical Fine Mapping of the NIMI Gene. 

The previous section described how a DNA segment containing the NIM gene was 
delineated by physical mapping of the flanking AFLP markers (Ler84.6b and Ler84.8) on 
the P1/BAC contig. The flanking markers appeared to map on two overlapping clones, P1- 
18 and BAC-04. This section describes how additional BAC-04-speclfic and P1-18-specific 
AFLP markers were generated to increase the resolution of the genetic and physical map in 
the region containing the NIMI gene. 
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viii. QeneraUon of Additional Ain.P MariMis from the Cosmid Anay. 

Four ECS were selected to generate additional AFLP marKers for fine mapping of 
MMU Psti/Msel, Xbal/Msel. BstYiyMse, and Taq.,Maei. Pstl/Msei and Xbal/Msel AFLP 
fragments were generated on clone P,.,8 and BAC-04 and the selective sequences 
necessaiy for detection in Arabidopsis were detemtined. Similarly, the AFLP fragments and 
selective sequences were determined for Bstyi/Msel and Taql/Msel; however, In this case 
the procedure was performed using cosmid DNAs: At 1 , C7, E1 and E8 for BstYI/Msel 
(complete region, and D7, E8 and E6 for Taql/Msel (South side o, MIW, region, 
infonhative AFLP markers selected for ,ur«,er genetic and physical mapping are shown in 
Table 13. Additional adapters used in this work are shown in Table 14. 



Table 13 



Marker 


EC/PC 




Ler84.YI 


BstYI(T)-GCT 


Msel-AAC 


Ws84.Y2 


BstYI(T)-TCT 


Msel-GCA 


Ler84.Y3 


BstYI(T)-AAG 


Msel-TAT 


Ler84.Y4 


BstYI(T)-GTT 


Msel-AGA 


Ws84.Tl 


Taql-TAC 


Msel-GGA 


Lcr84.T2 


Taql-TTG 


Msel-GGA 
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Table 14 shows same Additional adapters used for identifying new AFLP markers. 

Table 14 

BstYI: 5'-CTCGTAGACTGCGTACC-3' 

3'-CATCTGACGCATGGCTAG-5' 
TaqI: 5'-CTCGTAGACTGCGTACC-3' 

3'-CATCTGACGCATGGGC-5' 



ix. Physical mapping of new AFLP markers to the cosmid contig. 
The markers indicated above were physically mapped on the cosmid array by 
determining their presence in the various cosmid clones (Figure 11). 

1 . Genetic Mapping of New AFLP Markers. 

The new AFLP markers were genetically mapped by AFLP analysis of the closest 
North and South recombinants. The closest North (recombinant D169) and South 
(recombinant C105) recombination points were mapped (see Table 15). The AFLP analysis 
showed that recombinant D169 had a recombination South of marker L84.Y1 , but North of 
marker W84.Y2. The recombination point in recombinant CI 05 mapped between markers 
L84.T2 and L84.8. Using the available set of recombinants this allowed further delineation 
of the chromosomal interval containing NIM1; the distance between the flanking 
recombination points appeared to be 60-90 kb (Figure 12). 
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2. Construction of a Cosmid Contig, 

For complementation of the nimi plant phenotype. transformation of nimi plants is 
required with a wildtype NIM1 gene. This can be accomplished by transforming these 
plants with a cosmid containing the gene. For this purpose, a cosmid contig of the NIMI 
region is constructed. Since Arabidi^sis is transformed using Agrobacterium, the cosmid 
vector used is a binary vector. 

DNA was Isolated from BAC-04. BAC-06 and PI -18, and used to make a partial 
digest using restriction enzyme Sau3AI. The 20-25 kb fragments were isolated using a 
sucrose gradient, pooled, and filled in with dATP and dGTP. The binary vector (04541 ) 
was cleaved with Xhol and filled in with dCTP and dTTP. The fragments were next ligated 
into the vector. The ligation mix was packaged and transduced into E.coli. 

This cosmid library was screened with the BAC-04, BAC-06 and PI -18 clones and 
positive clones isolated. These cosmids were next AFLP fingerprinted and arranged into a 
contig of overlapping clones spanning the NIMI region. The insert sizes of the cosmids 
were detemiined, and limited restriction mapping was performed. The results are shown in 
Figure 10. 
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Example 2 

Identification of a Clone Containing the NIM1 gene. 
1. ComplGmentation Via Stable Transformation 

Cosmids that are generated from clones that span the NIM1 region (described 
above) are moved into Agrotacterium by triparental mating. These cosmids are then used 
to transform nimi Arabidopsis by vacuum infiltration (MIndrinos et al.. 1994, m 78. 1089- 
1099) or by standard root transfonnation. Seed from these plants is han,ested and allowed 
to germinate on agar plates with kanamycin (or another appropriate antibiotic) as selection 
agent. Only plantlets that are tmnsfonned with cosmid DNA can detoxify the selection agent 
and survive. Seedlings that survive the selection are transferred to soil and tested for the 
n^m phenotype or their progeny are tested for the nim phenotype. Transformed plants that 
no longer have the nim phenotype identify cosmid(s) that contain a functional N,M1 gene. 

2. Complementation in a Transient Expression System. 

The ability of DNA clones to complement the mutation is tested in 2 transient 
expression systems, 

in .he first sys.em. nml /,«6/<*^p,ante conlaining a PRi-luciferase (PRi-iux) 
.ransgene are used as bombardmen. recipient materia,. Tl„se piants are generated by 
transforming Columbia ecotype plants with a PR1.,ux construe, by vacuum infiltration, 
followed by kanamycin selection of the harvested seed, as described above, Transfom,ed 
plants that express iuciferase acflvity after Induction ,UA are se»ed and homozygous 
Plants are generated. These are crossed to n/m, piants. In the transient assay, progeny 
Plants from this cross that are homozygous for nin,1 and for PR,,„x are used for 
identification of DNA clones that can complement the ninri phenotype. To this end, the 
plants are firs, trea.ed with INA, as described in example 1., above. Two days later these 
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plants are harvested, surface sterilized and plated on GM agar medium. The leaf tissue is 
then bombarded with cosmid, P1 or BAG clones (or subclones) from the NIM1 region and 
after one day, the luciferase activity of the leaves is measured. Clones that induce 
luciferase activity contain the NIM1 gene. 

In a second system, nimi plants are treated with INA (as described in example 1.1 
above) and 2 days later bombarded with cloned DNA (cosmid, PI . BAG and/or YAG clones 
or subclones) from the NIMI locus region and a reporter plasmid. The reporter plasmid 
contains the luciferase gene, driven by the Arabidopsis PR1 promoter (PRI-lux). In nimi 
plants, INA does not activate the PR1 promoter (as described In example 1.2 above) and 
thus can not induce luciferase activity from the reporter plasmid. However, when a 
cotransformed DNA clone contains the complementing NIMI gene, INA does induce the 
PR1 promoter, as evidenced by an induction of luciferase activity. One day after the 
cobombardment. the luciferase activity of the whole plant is measured. DNA clones 
(cosmids, PI or BAG clones or subclones) that Induce luciferase activity that is significantly 
above background levels contain the NIMI gene. 

3. Changes in transcripts in nimi phenotype lines. 

Since nimi phenotype plants have mutations in the NIMI gene, it is conceivable that 
in some lines the gene is altered in such a manner that there is no mRNA transcribed, or an 
aberrant mRNA (size) is produced. To test for this, RNA blot analysis is performed on the 
nimi lines. 

RNA is isolated from Ws and Ler plants of these lines, (after water or INA or BTH 
treatment) and used to prepare northern blots. These blots are hybridized with DNA 
fragments isolated from clones of the DNA contig of the NIMI locus. DNA fragments that 
identify nimi lines with aberrant RNA expression (aberrant in size or concentration), likely 
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Wentlfy (par, of) me Nmi gene. The DNA fras^en. and surrou««„B DNA Is sequenced 
and used to Isolate a cDNA (by library screen or by reverse transcription-PCR). which is 
also sequenced. The clone from which the fragment was isolated or the isolated cONA is 
used to Show complen,en.ation of the „*„, phenotype in stable and transient expression 



systems. 



Example 3 

Determination of the DNA sequence of the NIMI gene. 
1 . Genomic sequencing. 

Oenomlc clones that may contain the Nm, gene are sequenced using methods 
known in the art. These Include BAC^M, P,-,8 and the cosmids from the Nm, region. For 
instance, the cosmids are digested with ■estrlc.lon enzymes and fragmenls that are derK,ed 
from the insert are cloned into a general purpose vector, such as pUC,8 or Bluescript. The 
larger P, and SAC clones are randomly sheared and fragments cloned in.o a general 
purpose vector. The fragments in these vectors are sequenced by conventional methods 
(e.8. by -primer walking- or generation of deletions of inserts). The obtained sequences are 
assembled Into a contiguous sequence. 

The sequence of the insert of a complementing ctone contains the M*f, gene. The 
approximate star, and end of the gene are deduced based on the DNA sequence 
sequence mo.«s such as TATA boxes, the open reading frames present In the sequence, 
codon usage, the cosmld complementation data, the relative location of the AFLP markers 
and addittonal relevant data that Is gathered (see Example 4, below). 
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2. cDNA sequencing. 

The cosmici(s) or larger clones that contain the NIM1 gene (as described in Example 
2 above), are used to Isolate cDNAs. This Is accomplished by using the clones (or DNA 
fragments) as probes in a screen of a cDNA library of wildtype Arabidopsis plants. The 
cDNAs that are isolated are sequenced as described for cosmid sequencing and used in 
complementation tests. To this end, full length cDNAs are cloned into a suitable plant 
expression vector, behind a constitutive promoter. These constructs are used in the 
transient assays as described above. Alternatively, the cDNAs are cloned into a binary 
expression vector, allowing for expression in plant tissues and for Agrobacteriunfh mediated 
plant transformation, as described in Example 2 above. A cDNA that contains the NIM1 
gene {as determined by complementation, isolation with a closely linked AFLP marker, 
isolation with a cosmid fragment, or by other deduction) is sequenced. 

The genes from Ws-O and nimi plants are isolated and sequenced. The genes are 
obtained from a cosmid of cDNA library, using a fragment of the isolated NfMI gene as a 
probe. Alternatively, the genes or cDNAs are isolated by PGR, using A///Wr-gene-speciflc 
primers and genomic DNA or cDNA as template. Likewise, the nimi alleles from other nimi 
lines (see Example 1.1 above) are isolated and sequenced in a similar manner. 

Example 4 

Description of the NIMI gene and deduced protein sequence 
The DNA sequence of the NIMI gene or cDNA is determined as described in 
Example 3 above. This sequence is analyzed with the use of DNA analysis programs, such 
as can be found in the Genetics Computer Group (GCG) package, in the Sequencer or 
Staden packages, or any similar DNA analysis program package. 



SUBSTITUTE SHEET (RULE 26) 



wo 97/49822 



PCT/EP97/01218 



-62 



specifically, me star, and end o1 ,he gene are de,em,lned. based on open reading 
irame analysis. U,e presence of stop and po.en«al start codons, the presence of potential 
promoter motifs (such as the TATA box), .he presence of polyadenylatlon signals and .l,e 
like, ^so, the predicted amino add Is de<»^ from the open reading frame. Both the DNA 
and protein sequence are used to search databases for sequences «i,h homologies, such 
as transcriplion factors, enzymes or motifs of such genes or proteins. 



Example 5 
Isolation of NIM1 homologs 
The A^l^ gene may be used as a probe in the low stringency 
hybridization screening of a genomic or cDNA library In order to isolate Nm, homologs 
from other plan, species. A.tema«ve,y, this Is accomplished by PCR amplification, using 
primers designed based on the Aratl^sls Nmi gene sequence and using genomic DNA 
or CDNA as .empla.e. The mm gene may be ^ated ,rcm com, wheat, rice, bartey,rape 
seed, sugarbeet. potato, tomato, bean, cucumber, grape, tobacco and o.her crops o, 
inreres, and sequenced, Wi.h a set o, sequences from fV,«, gene homologs In hand, new 
Primers can be designed from conserved portions of the gene, in order to isolate Nmi 
homologs from more disttntly related plant species by PCR ampHHcatlon. 

Example 6 

Complementation of the niml-l gene with genomic fragments. 
1 . Construction of a cosmid contig. 

A cosmid con.ig of .he Nmi region was oonst^cted using CsCl-purified DNA from 
BAC04, BAC06 and P,-,8. The DNAs of the three clones were mixed in equimolar 
quantities and were partially diges.ed with .he res.ric.ion enzyme Sau3A. The 20-25 Kb 
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f ragments were isolated using a sucrose gradient, pooled and filled in with dATP and dGTP. 
Plasmid pCLD04541 was used as T-DNA cosmid vector. This plasmid contains a broad 
host range pRK290-based replicon. a tetracycline resistance gene for bacterial selection 
and the nptll gene for plant selection. The vector was cleaved with Xhol and filled in with 
dCTP and dTTP. The prepared fragments were then ligated into the vector . The ligation 
mix was packaged and transduced into E. co// strain XLI-blue MR (Stratagene). Resulting 
transformants were screened by hybridization with the BAC04, BAC06 and P1-18 clones 
and positive clones isolated. Cosmid DNA was isolated from these clones and template 
DNA was prepared using the ECs EcoRI/Msel and Hindlll/Msel. The resulting AFLP 
fingerprint patterns were analyzed to determine the order of the cosmid clones, A set of 15 
semi-overlapping cosmids was selected spanning the nim region (Figure 13). The cosmid 
DNAs were also restricted with EcoRI, PstI, BssHII and SgrAI. This allowed for the 
estimation of the cosmid insert sizes and the verification of the overlaps between the 
various cosmids as detenmined by AFLP fingerprinting. 

2. Identification of a clone containing the MM gene. 

Cosmids generated from clones spanning the NIMI region were moved into 
Agrobacterium tumefaciens /KGL-I through conjugative transfer in a tri-parental mating with 
helper strain HB101 (pRK2013). These cosmids were then used to transform a kanamycin- 
sensitive niml-l Arabidopsis line using vacuum infiltration (Mindrinos et al,, 1994, Cell 78, 
1089-1099). Seed from the infiltrated plants was harvested and allowed to germinate on 
GM agar plates containing 50 mg/ml kanamycin as a selection agent. Only plantlets that 
are transformed with cosmid DNA can detoxify the selection agent and survive. Seedlings 
that sun/ive the selection were transferred to soil approximately two weeks after plating and 
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tested for the nim 1 phenotype as described below. Transformed plants that no longer have 
the nimi phenotype identify cosmid(s) that contain a functional NIM1 gene. 

3. Testing for the nimi phenotype of transformants. 

Plants transferred to soil were grown In a phytotron for approximately one week after 
transfer, aoopm INA was applied as a fine mist to completely cover the plants using a 
chromlster. After two days, leaves were harvested for RNA extraction and PR-1 expression 
analysis. The plants were then sprayed with Peronospora parasitica (isolate EmWa) and 
grown under high humidity conditions in a growing chamber with IQ^C day/1 7» night 
temperatures and 8h light/16h dark cycles. Eight to ten days following fungal infection, 
plants were evaluated and scored positive or negative for fungal growth. Ws and nimi 
plants were treated in the same way to serve as controls for each experiment. 

Total RNA was extracted from the collected tissue using a LiCl/phenol extraction 
buffer (Venvoerd. et al. NAR 17:2362). RNA samples were run on a formaldehyde agarose 
gel and blotted to GeneScreen Plus (DuPont) membranes. Blots were hybridized with a 
-P-labeled PR-1 cDNA probe. The resulting blots were exposed to film to determine which 
transformants were able to induce PR-i expression after INA treatment. The results are 
summarized in Table 1 6. 
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Table 16 shows complementation of the nimi phenotype by cosmid clones. 
Table 16 



Clone Name 


# of transfomiants 


# of plants with INA induced 
PR- 1 /total # of plants tested (%) 


A8 


3 


0/3 (0%) 


All 


8 


4/18(22%) 


C2 


10 


1/10(10%) 


C7 


33 


1/32 (3%) 


D2 


81 


4/49 (8%) 


D5 


6 


5/6 (83%) 


EI 


10 


10/10(100%) 


D7 


129 


36/36(100%) 


E8 


9 


0/9 (0%) 


F12 


6 


0/6 (0%) 


E6 


1 


0/1 (0%) 


E7 


34 


0/4 (0%) 


WS-control (wild-type) 


NA 


28/28 (100%) 


niml'l phenotype control 


NA 


0/34 (0%) 


NA-nol applicable 


Example 7 





Sequencing of the 9.9 Kb NIM1 gene region. 
BAC04 DNA (25 ug, obtained from KeyGene) was the source of DNA used for 
sequence analysis. This BAG was shown to be the clone completely encompassing the 
region that complemented the nimi mutants. DNA was randomly sheared using an 
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approach from Cold Spring Harbor. Briefly, BAC DNA was ahearad In a nabulizer to an 
avarapa molecular weight of about 2 kb. Ends of the sheared fragments were repaired 
using a two-a.ep protocol with dNTPS, T4 DNA polymerase and Klenow fragment 
(Boehringer). The and-repaired DNA was n,n on a 1% lo«^malt agarose gel and the region 
between 1.3 Kb and 2.0 Kb was cut from the gel. DNA was isolated from the gal fragment 
by a freeze-thaw approach. DNA was then mixed with EcoRV-digested pBRKanF4 and 
was ligated overnight at 4»C. pBRKanF4 is a derivative of pBRKanF, . which was obtained 
from Kolavi Bhat at Vanderbllt University (Bhat. K.S.. Gene 134(1), 83-B7 (1993)). E. col, 
strain DH5a was transfomied with the llgaHon mix, and the transformation mix was plated 
onto plates containing kanamydn and X-gal. ,600 white or light blue KanR colonies were 
selected (or plasmid isolation. Individual colonies we™ picked into 96-well deep well plates 
(Polyfiltronics, #US08) containing 1.5 ml of TB . Kan (50 ug,ml). Plates were covered and 
were placed on a rotating platform shake, at 37»C for 16 hrs. Plasmid DNA was Isolated 
using the Wizard Plus 9600 ►^Inlprep system (Promega. (fATOOO) according to 
manufacturer's recommendations. 

Plasmids were sequenced using Dye Terminator chemistry (Applied BioSystems 
PRISM Dye Terminator Cyde Sequencing Ready Reaction Kit, P/N 402078) and primers 
designed to sequence both strands of the plasmids. Data was collected on ABI 377 DNA 
sequencers. Approximately 75% of these reactions yiekJed useful sequence information, 
sequences were edited and were assembled into contigs using Sequencher 3.0 (Qene 
codes Corporation). Staden 9ap4 (Roger Staden. e-mail address rsemn^-lmb.cam.ac.uk), 
and PHRED (Phil Green, e-mail address phgeu.washlngton.edu). The largest contig 
(approximately 76 kb) covered ,he complementing reg.on to an average depth of 7 
independent calls/base. 
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A region of approximately 9.9 kb defined by the overlap of cosmlds El and D7 was 
identified by complementation analysis to contain the nimi region. Primers which flanked 
the insertion site of the vector and specific to the cosmid backbone were designed using 
Oligo 5.0 Primer Analysis Software (National Biosciences, Inc.). DNA was isolated from 
cosmids D7 and El using a modification of the ammonium acetate method (Traynor, P.L.. 
1990. BioTechniques 9(6): 676.) This DNA was directly sequenced using Dye Terminator 
chemistry above. The sequence obtained allowed determination of the endpoints of the 
complementing region. 

A truncated version of the BamHI-EcoRV fragment was also constructed, resulting in 
a construct which contains none of the 'Gene 3" region (Fig. 13). The foHowing approach 
was necessary due the presence of Hindlll sites In the Bam-Spe region of the DNA. The 
BamHl-EcoRV construct was completely digested with Spel, then was split into two 
separate reactions for double digestion. One aliquot was digested with BamHI, the other 
Hindlll. A BamHI-Spel fragment of 2816 bp and a Hindlll-Spel fragment of 1588 bp were 
isolated from agarose gels (QiaQuick Gel extraction kit) and were ligated to BamHI-Hindlll- 
digested pSGCGOI. DHSa was transformed with the ligation mix. Resulting colonies were 
screened for the correct insert by digestion with Hindlll following preparation of DNA using 
Wizard Magic MiniPreps (Promega). A clone containing the correct construct was 
electroporated into Agrobacterium strain GV3101 for transformation of >Afai>/dops/s plants. 
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Example 8 

Identification of the W^y gene region by allele sequencing. 



Table 17. Genetic segregation of non-inducible 



immunity mutants. 



Phenotype 

Mutant Generation Female MaleWildtype nimi 



nim/^I" Fl wildtype" niml-l 24 

F2 



0 

98 32 



niml.2 Fl nimJ-2 Wildtype 3 o 

niml-3 Fl niml-S Wildtype 3 o 

nimJ-4 Fl „im/.4 Wildtype 3 0 

nimj.5 Fl „/„/.j wildtype 0 35 

niml.6 Fl niml-d Wildtype 3 0 

nimJ-2 Fl nimJ-2 niml-I 0 

"'"'"'-■^ Fl niml-3 niml-1 0 

mm/.4 Fl mml-4 niml-I 0 



15 



10 



15 
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nimlS Fl nimJ-5 niml-1 0 14 

F2 9 85 

niml-6 Fl niml'6 nimJ-J 0 12 



* Data from Delaney et al. ( 1 995) PNAS 92,6602-6606. 
Wild type denotes the wildtype Ws-0 strain. 



1 . Genetic Analyses 

To determine dominance of the various mutants that displayed the nimi phenotype, 
pollen from wildtype plants was transferred to the stigmata of niml-I. -2, -3. -4. -5, -6. if the 
mutation is dominant, then the nimi phenotype will be observed in the resulting F1 plants. 
If the mutation is recessive, then the resulting F1 plants will exhibit a wildtype phenotype. 

The data presented in Table 17 show that when niml-l, -2, -3, -4 and -5 are crossed 
with the wildtype, the resulting Fl exhibit the wildtype phenotype. Thus, these mutations 
are recessive. In contrast, the nimi -5 Xwildtype Fl progeny all exhibit the nim1 phenotype. 
indicating that this is a dominant mutation. Following INA treatment, no P. parasitica 
sporulation was observed on wildtype plants, while the F1 plants supported growth and 
some sporulation of P. parasitica. However, the nimi phenotype in these Fl plants was 
less severe then observed when nimi -5 was homozygous. 

To determine allelism, pollen from the kanamycin-resistant niml-I mutant plants was 
transferred to the stigmata of nim1-2, -3. -4. -5. -6. Seeds resulting from the cross were 
plated onto Murashige-Skoog B5 plates containing kanamycin at 25 ug/ml to verify the 
hybrid origin of the seed. Kanamycin resistant (Fl) plants were transferred to soil and 
assayed for the nimi phenotype. Because the Fl progeny of the cross of the nimi -5 
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mutant with the Ws wildtype displayed a niml phenotype. analysis of niml-SX nimUl F2 
was also carried out. 

As shown rn Table 17. all of the resulting F1 plants exhibited the niml-1 phenotype. 
Thus, the mutation in the nim1-2, -3. -4. -5. ^ was not complemented by the niml-V, these 
plants all fall within the same complementation group and are therefore allelic. Analysis of 
F2 progeny from the niml-S X niml-l cross also displayed the niml phenotype. confirming 
that niml -5 is a niml allele. 



2. Sequence Analysis and Subcloning of the NIM1 Region 

The 9.9 kb region containing the NIM1 region was analyzed for the presence of 
open reading frames in all six frames using Sequencher 3.0 and the GCG package. Four 
regions containing large ORFs were identified as possible genes (Gene regions 1-4). 
These four regions were PGR amplified from DNA of the wild-type parent and six different 
niml allelic variants. Primers for these amplifications were selected using Oligo 5.0 
(National Biosciences. Inc.) and were synthesized by Integrated DNA Technologies, Inc. 
PGR products were separated on 1,0% agarose gels and were purified using the QIAquick 
Gel Extraction Kit. The purified genomic PCR products were directly sequenced using the 
primers used for the initial amplification and with additional primers designed to sequence 
across any regions not covered by the initial primers. Average coverage for these gene 
regions was approximately 3.5 reads/base. 

Sequences were edited and were assembled using Sequencher 3.0. Base changes 
specific to various niml alleles were Identified only In the region designated Gene Region 2. 

The positions listed in Table 18 relate to Figure 14 and relate to the top strand of the 
9.9 kb region featured in Figure 13. The open reading frames from the gene regions 
described in Figure 13 as 1 . 2, 3 and 4 were sequenced and the changes in the different 
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nimi alleles are shown in the Table. The changes that are described are on the top strand, 
50 to 30, as It would relate to Figure 13. 

It is apparent that the NIMI gene was cloned and that it lies within Gene Region 2, 
since there are amino acid changes or alterations of sequence within the open reading 
frame of Gene Region 2 in all 6 nimi alleles. At the same time, at least one of the nimi 
alleles shows no changes in the open reading frames within Gene Regions 1, 3 and 4. 
Therefore, the only gene within the 9.9 kb region that could be NIMI Is the Gene Region 2. 
the NIM1 gene. 

The Ws section of Table 18 indicates the changes in the Ws ecotype of Ambidopsis 
relative to the Columbia ecotype of Arabidopsis. Figures 13. 14, 15 and all others wherein 
sequence is shown relate to the Columbia ecotype of Arabidopsis, which contains the wild 
type gene in the experiments that were conducted. The changes are listed as amino acid 
changes within the gene 2 or NIMI region and are listed as changes in base pairs in the 
other regions. 

Figure 13 shows 4 different panels that describe the cloning of the NIMI gene and 
describe the entire 9.9 kb region. Figure 14 is the sequence of the entire 9.9 kb region in 
the same orientation as described in Figure 13. Figure 15 is the sequence of the specific 
NIMI gene region which is gene region 2 indicated in Figure 13; the sequence of Figure 15 
contains the NIMI gene. Figure 15 shows the amino acid sequence in single letter code 
and shows the full length cDNA and RACE product that was obtained in capital letters in the 
DNA sequence. Some of the allele mutations that were found are shown above the DNA 
sequence and the particular nimi allele that had that change is indicated. 

Sequence analysis of the region and sequencing of various nimi alleles (see below) 
allowed identification of a cosmid region that contains the nimi gene. This region is 
delineated by a BamH1-EcoRV restriction fragment of -5.3 kb. Cosmid DNA from D7 and 
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Plasmid DNA from pBlueScriptll(pBSIl)were digested with Bam HI and with EcoRV (NEB). 
The 5.3 kb fragment from D7 was isolated from agarose gels and was purified using the 
QIAquick gel extraction kit (# 28796. Qiagen). The fragment was ligated overnight to the 
Bam-EcoRV-digested pBSIl and the ligation mixture was transformed into E. co// strain 
DH5a. Colonies containing the insert were selected. DNA was isolated, and confirmation 
was made by digestion with Hindlll. The Bam- EcoRV fragment was then engineered into a 
binary vector (pSGCGOl) for transfomiation Into Ambidopsis. 



3. Northern analysis of the four gene regions. 

Identical Northern blots were made from RNA samples isolated from water-. SA-. 
BTH- and INA-treated Ws and nin^l lines as previously described (Delaney et al. 1995. 
PNAS 92. 6602-6606). These blots were hybridized with PGR products generated from the 
four gene regions identified In the 9.9 kb NiMI gene region. Only the gene region 
containing the N,M1 gene (Gene Region 2) had detectable hybridization with the RNA 
samples, indicating that only the N,M1 region contains a detectable transcribed gene 
(Figure 16 and Table 18). 
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Table 18 shows nimi allele sequence variation. 

Table 18 





Gene Region 


Allele/ 


1 


C ytMltvl f } 




A 


ecotvDe 




(oases 1 000-4 luu) 


(oases oo/u - 


(oases 81 40- 




1090) 




6840) 


9210) 


niml'1 


no changes 


t inserted at 2981: change of 


no changes 


no changes 






7AA and premature 










termination of protein. 






nimi '2 


no changes 


g to a at 2799: His to Tyr 


no changes 


no changes 


nimU3 


no changes 


deletion of t at 3261 : change 


no changes 


no changes 






01 iUM/\ ana prernaiure 










fa rnni nation /^f r^rAtAivt 

lerrniriaiiuri oi proiein. 








no cnanyeo 


c 10 1 ax ^iu^. Mrg lO lys 


no cnanges 


no changes 


nimi mf% 


no changes 


c to t at 2402: Arg to lys 


no changes 


no changes 


nimUo 


g to a at 734: 


g to a at 2670: Gin to Stop 




no changes 




asp to iys 




no changes 




WS 


no changes 


a to g at 1607: lie to Leu 


t to a at 5746 


t to g at 8705 


(comparea 




a to c at 2344: intron 


a to t at 5751 


g to t at 8729 


to 




t to g at 2480: Gin to Pro 


ttoaat5754 


gtotat8739 


Columbia) 




g to c at 2894: Set to Trp 


c to t at 6728 


g to t at 8784 






ggc deleted at 3449: lose Ala 


a to tat 6815 


c to a at 8789 






c to t at 3490: Ala to Thr 


t to cat 6816 


c to tat 8812 






c to t at 3498: Ser to Asn 




a to g at 8829 






a to t at 3873: non-coding 




ttogat8856 






g to a at 3992: non-coding 




a to c at 9004 
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g to a at 4026: non-coding 
g to a at 4061 : non-coding 




a to t at 901 1 
a to g at 8461 


RNA 
detected 


No 


Yes 


No 


No 



Positions listed In the table relate to Figure 14 containing the 9.9Kb sequence. All alleles 
nim1-l to nim1-6 are WS strain. Columbia-o represents the wild type 



We also demonstrated that the gene region 2 (Fig. 13) contains the functional NIM1 
gene by doing additional complementation experiments. A BamHI/Hindlll genomic DNA 
fragment containing gene region 2 was isolated from cosmid D7 and was cloned into the 
binary vector pSGCGOl containing the gene for kanamycin resistance (Fig. 1 3; Steve Goff 
personal communication). The resulting plasmid was transformed Into the Agrobacterium 
strain GV3101 and positive colonies were selected on kanamycin. PGR was used to verify 
that the selected colony contained the plasmid. Kanamycln-sensitive nim1-i plants were 
infiltrated with this bacteria as prevously described. The resulting seed was harvested and 
planted on GM agar containing 50^g/ml kanamycin. Plants surviving selection were 
transferred to soil and tested for complementation. Transformed plants and control Ws and 
niml plants were sprayed with 300nm INA. Two days later, leaves were harvested for RNA 
extraction and PR-1 expression analysis. The plants were then sprayed with Peronospora 
parasitica (isolate EmWa) and grown as previously described. Ten days following fungal 
infection, plants were evaluated and scored positive or negative for fungal growth. All of the 
15 transformed plants, as well as the Ws controls, were negative for fungal growth following 
INA treatment, while the n/mrcontrols were positive for fungal growth. RNA was extracted 
and analyzed as described above for these transformants and controls. Ws controls and all 
15 transformants showed PR-1 gene induction following iNA treatment, while the niml 
controls did not show PR-1 induction by INA. 
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4. Isolation of a NIM1 cDNA. 

An Arabidcpsis cDNA library made in the lYES expression vector (Elledge et al, 
1991. PNAS 88, 1731-1735) was plated and plaque lifts were performed. Filters were 
hybridized with a ^^P-labeled PGR product generated from the gene region containing nim- 
1. 14 positives were identified from a screen of approximately 150,000 plaques. Each 
plaque was purified and plasmid DNA was recovered. cDNA inserts were digested out of 
the vector using EcoRI, agarose-gel-purified and sequenced. Sequence obtained from the 
longest cDNA is indicated in Figure 15, To confirm that we had obtained the 5c end of the 
cDNA. a Gibco BRL 5' RACE kit was used following manufacturer's instructions. The 
resulting RACE products were sequenced and found to include the additional bases 
indicated in Figure 15. The transcribed region present in both cDNA clones and detected in 
RACE is shown as capital letters in Figure 15. Changes in the alleles are shown above the 
DNA strand. Capitals indicate the presence of the sequence in a cDNA clone or detected 
after RACE PGR. 

Example 9 
Characterization of the NIM1 gene 
The multiple sequence alignment was constructed using Clustal V (Higgins, 
Desmond G. and Paul M. Sharp (1989), Fast and sensitive multiple sequence alignments 
on a microcomputer. CABIQS 5:151-153) as part of the DNA* (1228 South Park Street. 
Madison Wisconsin. 53715) Lasergene Biocomputing Software package for the Macintosh 
(1994). 

It has been determined that certain regions of the NIM1 protein are homologous in 
amino acid sequence to 4 different rice cDNA protein products. The homologies were 
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identified using the NIM1 sequences In a GenBank BLAST search. Comparisons of the 
regions of homology in N,M1 and the rice cDNA products are shown in Figure 19. The 
NIM1 protein fragments show from 36 to 48% identical amino acid sequences with the 4 
rice products. 

Example 10 

Phenotypic characterization of the various nimi alleles 
1. Analysis of chemical responsiveness in nimi alleles. 

We analyzed the differences among the various nimi alleles In terms of chemical 
induction Of PR gene expression and Peronospora parasitica resistance (see Figures 17 and 
18). 

Mutant plants were treated with chemical inducers and then assayed for PR gene 
expression and disease resistance. 



2. Plant growth and chemical application. 

Wild-type seeds and seeds for each of the mm, alleles -z, .3. .4, .s_ 

were sown onto MetroMix 300 growing media, covered with a t^nsparent plae«c dome and 
placed at 4' C in the da,t< fo, 3 days. After 3 day, of 4» C Ireatmem the plants were moved 
.0 a phytotron for 2 weeks. A, approximately 2 weeks post-planing, gem,i„ated seedlings 
had produced 4 tn,e leaves. Plants were then tt-eated wm H,0, 5mM SA, 300 uM BTH o, 
300 UM INA. Chemicals were applied as a fine mis. ,0 completely cover «,e seedlings using 
a chromister. Water control plants were returned ,0 the growing phytotron while the 
Chemically treated plants were hekJ in a separate bu, Identical phytotron. After 3 days 
Plants were divided Into 2 groups. One group was harvested for RNA extraction and 
analysis. The second group was inoculated with P. parasitica. 
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3. Peronospora parasitica inoculation and analysis. 

P. parasitica isolate 'EmWa' is a P.p. isolate that is compatible In the Ws ecotype. 
Compatible isolates are those that are capable of causing disease on a particular host. The 
P. parasitica isolate 'NoCo' is incompatible on Ws but compatible on the Columbia ecotype. 
Incompatible pathogens are recognized by the potential host, eliciting a host response that 
prevents disease development. At 3 days post-chemical application water and chemically 
treated plants were inoculated with the compatible 'EmWa' isolate. 'NoCo' inoculation was 
conducted on water treated plants only. Following inoculation plants were covered with a 
clear plastic dome to maintain high humidity required for successful P. parasitica infection 
and placed in a growing chamber with 19" C day/17° C night temperatures and 8h light/1 6h 
dark cycles. 

At various timepoints after inoculation plants were analyzed microscopically to 
assess symptom development. Under magnification sporulation of the fungus can be 
observed at very early stages of disease development. The percentage of plants/pot 
showing sporulation at 5d, 6d, 7d, 1 1d and 14d after inoculation was determined and the 
density of sporulation was also recorded. 

Figure 18 shows the disease assessment of the various nimi alleles following P. 
parasitica inoculation. The most distinguishing timepoints are 5 and 6 days post- 
inoculation. At 5 days post-inoculation nim1-4 shows -80% infection under all inducing 
chemical treatments performed, clearly indicating that this allele/genotype has the most 
severe disease susceptibility. At 6 days post-inoculation, nimi. -2, -3. -4 and -6 show 
significant disease incidence under ail inducing chemical treatments. However, nimi -5 
shows less infection than Ws wild-type under all treatments at day 6. Therefore, nimi -5 is 
the most disease resistant of the various nimi alleles, appears intermediate with 

respect to disease susceptibility after BTH but not the other inducing treatments. 
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PR-1 gene expression indicates that niml^ is the least responsive to all of the 
inducing chemicals tested (Figure 17). while n/mr-5 shows elevated levels of PR-1 
expression in the absence of inducers. These PR-i gene expression results are consistent 
With the disease assessment performed with P. parasitica (Figure 18) and indicate that nimi 
alleles can cause resistance or susceptibility. 

The samples obteine<i above were used to analyze NIMi gene expression (Figure 
.7). m wildlype Plants MM, n,RNA was present (n the untreated control samples. 
Following treatment with SA. INA. BTH or infection with a con,pa«5le pathogen the Nm 
mRNA accumulated to higher levels. Differences In MA*, messaga <mRNA, abundance 
were obsen«d in the n^l alleles compared to wildtype. The abundance of Af/*f, mRNA In 
untreated mutant plants was lower than obsen^d In the wildtype with the excepSon of nlml- 
2 and -5 where the amounts were similar. The n/m,-,, .3 and -4 had low levels of Nmi 
message while the n/m,.6had very low accumulation o, NIMr mRNA. Increases In MM, 
mRNA following SA, INA or BTH were observed In n/m,-,, -2. .3 but not nM-s o, ... 
However, this increase was less than observed In wildtype plants. Following pathogen 
infection additional bands hybridizing to the MM, cDNA probe were obsen,ed In both 
^Idtype and mutants and the MM, mRNA level was elevated relaUve to untested controls, 
except in nimU6. 

Figure 18 shows the disease resistance assessment via infection rating of the 
various n/m, alleles as we,, as the NahG plants a. various times after innoculaUon with 
Pemnosporapanasfffca. WsWT indicates the Ws ^Id type parent line in which the nM 
alleles are found. The venous n™, alleles are indicated in the table and the Afa/,G plant Is 
indicated also. The NahG plan, has been previously published. (Oelaney et al. Science 
266, pp. 1247-1250 (1994)). The NahG AraMopsis is also described in WO 95/19443. 
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The NahG gene is a gene from Pseudomonas putida that converts salicylic acid to 
catechol, thereby eliminating the accumulation of salicylic acid, a necessary signal 
transduction component for SAR in plants. Thus. NahG Arabidopsis plants do not display 
normal SAR. In addition, they show much greater susceptibility in general to pathogens. 
Therefore, the NahG plants serve as a kind of universal susceptibility control. In addition, 
the NahG plants still respond to the chemical inducers INA and BTH; this Is shown in the 
bottom two panels of the Figure 17, 

From Figure 18 it can be seen that the nim1-4 and n/mr-e alleles seem to be the 
most severe; this is most easily observable at the earlier time points, described earlier In the 
results section herein, and from the results set forth in the EmWa BTH panel, the lowest 
panel, in the Figure. In addition, the n/m 7-5 allele shows the greatest response to both INA 
and BTH and therefore it is the weakest nimi allele. 

The NahG plants show very good response to both INA and BTH and look very 
similar to the n/m 7-5 allele. However, at late time points. Day 1 1 in the Figure, the disease 
resistance induced In the NahG plants begins to fade, and there is a profound difference 
between INA and BTH in that the INA-induced resistance fades much faster and more 
severely than the resistance induced in the NahG plants by BTH. Also seen in these 
experiments is that INA and BTH induced very good resistance in Ws to EmWa, and the 
n/m7-7. nim1-2 and other nimi alleles show virtually no response to SA or INA in regard to 
disease resistance. 

Figure 18 lists the percent of plants that are showing sporulation after infection with 
the EmWa race of P. parasitica, and each of the bar graphs indicates the number of days 
after infection that the disease resistance was rated. 

Northern analysis analyzing the expression of the SAR gene PRI was also 
performed on the same samples, as shown in Figure 17. Figure 17 shows that the wild type 
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plant Shows very good response to salicylate. INA. BTH and also to pathogen infection, as 
manifested by enhanced PRI gene expression. The niml-l allele, on the other hand, 
shows only very limited response to all the chemical inducers including pathogen. 

The pathogen Induction is at least several fold lower in the nim1-1 allele than it is in 
the Wild type. The nim1.2, nim1.3an6 alleles show response similar to the nim1-1 

allele to the various treatments. However, the niml^ allele shows virtually no expression in 
response to any of the inducers used. Basically, background level is all that is observed. 
The n/mr-5 allele shows a very high background level relative to controls with water and 
that background level is maintained in all the treatments: however, there is limited or no 
induction by the chemical inducers. 

The NahG plants serve as a good control, showing that they are unable to induce 
PR-1 in the presence of SA; on the other hand. INA and BTH both Induce very strong high 
level expression of PR-1 . The effect of pathogen infection is similar to that of SA; there is 
no expression of PR-1 in the EmWa-treated NahG plants. 

These same RNA samples produced in the induction studies were also probed with 
a NIM1 gene using a full-length cDNA clone as probed. In Figure 16 It can be seen that INA 
induces the NIMI gene in the wild type Ws allele. However, the niml-l mutation allele 
shows a lower basal level expression of the N,M1 gene, and it is not Inducible by INA. This 
is similar to what is observed in the mmr-^ allele and the n/mr^allele. The n/mr-2 allele 
shows approximately normal levels in the untreated sample and shows similar induction to 
that of the wild type sample, as does the nim1.4 allele. The n/mr-5allele seems to show 
higher basal level expression of the NIMI gene and much stronger expression when 
induced by chemical inducers. 
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The induction of NIM1 by chemical inducers of resistance and other inducers is 
consistent with its role in pathogen defense and is also further evidence that we have 
obtained the right gene in the 9.9 kb region. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Novartis AG 

(B) STREET: Schwarzwaldallee 215 

(C) cmr: Basel 

(E) COUNTRY: Switzerland 

(F) POSTAL CODE (ZIP) : 4002 

(G) TELEPHOJE: +41 61 69 11 H 

(H) TELEFAX: + 41 61 696 79 76 

(I) TELEX: 962 991 



(ii) TITLE OF INVENTION: GENE CONFERRING DISEASE RESISTANCE IN 
PLANTS AND USES THEREOF 

(iii) NUMBER OF SEQUENCES: 11 



(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version 

(2) INFORMATION FOR SEQ ID N0;1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9919 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

TGATCATGAA TTGCGTGTAG GGTTGTGTTT TAAAGATAGG GATGAGCTGA AGAAGGCGGT 60 

GGACTGGTGT TCCATTAGAG GGCAGCAAAA GTGTGTAGTA CAAGAGATTG AGAAGGACGA 120 

GTATACGTTT AAATGCATCA GATGGAAATG CAATTGGTCG CGTCGGGCAG ATTGAATAGA 180 

AGAACATGGA CTTGTTAAGA TAACTAAGTG TAGTTGGTCC ACATACTTGT TGTTCTATTA 240 

AGCCGGAAAA CTTCAACTTG TAATTTGCAG CAGAAGAGAT TGAGTGTCTG ATCAGGGTAC 300 

AACCCACTCT AACAGCAGAG TTGAAAAGTT TGGTGACATG CTTAAAACTT CAAAGCTGCG 360 

GGCAGCAGAA CAGGAA6TAA TCAAAGATCA GAGTTTCAGA GTATTGCCTA AACTAATTGG 420 

CTGCATTTCA CTCATCTAAT GGGCTACTTG TGGACTGCAA TATGAGCTTT TCCCTAATCC 480 

TGAATTTGCA TCCTTCGGTG GCGCGTTTTG GGCGTTTCCA CAGTCCATTG AAGGGTTTCA 540 

ACACTGTAGA CCTCTGATCA TAGTGGATTC AAAAGACTTG AACGGCAAGT ACCCTATGAA 600 

ATTGATGATT TCCTCAGGAC TCGACGCTGA TGATTGCTTT TTCCCGCTTG CCTTTCCGCT 660 

TACCAAAGAA GTGTCCACTG ATAGTTGGCG TTGGTTTCTC ACTAATATCA GAGAGAAGGT 720 

AACACAAAGG AAAGACGTTT GCCTCGTCTC CAGTCCTCAC CCGGACATAG TTGCTGTTAT 780 

TAACGAACCC GGATCACTGT GGCAAGAACC TTGGGTCTAT CACAGGTTCT GTCTGGATTG 840 

TTTTTGCTTA CAATTCCATG ATATTTTTGG AGACTACAAC CTGGTGAGCC TTGTGAAGCA 900 

GGCTGGATCC ACAAGTCAGA AGGAAGAATT TGATTCCTAC ATAAAGGACA TCAAAAAGAA 960 

GGACTCAGAA GCTCGGAAAT GGTTAGCCCA ATTCCCTCAA AATCAGTGGG CTCTGGCTCA 1020 

TGACCAGTGG TCGGAGATAT GGAGTCATGA CGATAGAAAC AGAAGATTTG AGGGCAATTT 1080 

GTGAAAGCTT TCAGTCTCTT GGTCTATCAG TGACAGCGAA CGCACCTGCA CATGTGGGAA 1140 
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GTTTCAATCG AAGAAGTTTC CATGTATGCA CCCAGAAATG GTGCAAAGGA TTGTTAACTT 
GTGTCATTCA CAAATGTTGG ATGCAATGGA GCTGACTAGG AGAATGCACC TTACACGCCC 
ACTCAGK.TT CTCTTATCTC TAGACCTGAA ACTAACTTGC TCTCTAATTC GAGTTACAAA 
AGGTTAAAGG AAGAATTAGG AAGATACATA TAACATGAAT GTTGCCAGAA GTTCAGGGAA 
CTTGAATATT CTTTTGGTTC TTCGTGGAAA ATATCCAACA GATCAACAAT TTGACATTAT 
TTCACACTTT GATTCTAGCA ACTCTGTAAC ACCATCATGG GTTATTCtTC ATGTACATAA 
ATATATATTA CAAATCTCTA TACCATTGGT TCAAATTGTT ACAACATTTG TTTGAAGCAC 
ACCTGCAGCA ATAATACACA GGATGCAAAA CGAAGAGCGA AACTATATX.A CGCCAACGAT 
AGACATAAAC AGTTACAGTC ATCATGAAAA CAGAATTATA TGGTACAGCA AAAAT-TACAC 
TAAGAGGCAA GAGTCTCACC GACGACGATG AGAGAGTTTA CGGTTAGACC TCTTTCCACC 
GGTTGATTTC GATGTGGAAG AAGTCGAATC TGTCAGGGAC GAATTTCCTA ATTCCAAATT 
GTCCTCACTA AAGGCCTTCT TTAGTGTCTC TTGTATTTCC ATGTACCTTT GCTTCTTTTG 
TAGTCGTTTC TCAGCAGTGT CGTCTTCTCC GCAAGCCAGT TGAGTCAAGT CCTCACAGTT 
CATAATCTGG TCGAGCACTG CCGAACAGCG CGGGAAGAAT CGrTTCCCGA GTTCCACTGA 
TGATAAAAAA AACAAGGTCA GACAGCAAGT AACAAAACCA TGTTTAAAGA TCATITAGTT 
TTGTT™ l^ATAAGGAG TCCGATGAAG TGGGTGAGAA TCCATACCGG TTTTAGAAAG 

CGCrrrrAGT CTACrrrGAT gctcttctag gattctgaaa ggtgctatct ttacacccgg 

TGATGTTCTC TTCGTACCAG TGAGACGGTC AGGCTCGAGG CTAGTCACTA TGAACTCACA 
TGTTCCCTTC ATTTCGGCGA TCTCCATTCC AGCTTGTCCT TCCGTTCGAA AAAGACGTTG 
AGCAAGTGCA ACTAAACAGT GGACGACACA AAGAATAGTT ATCATTAGTT CACTCAGTTT 
CCTAATAGAG AGGACATAAA TTTAATTCAA ACATATAAGA AATAAGACTT GATAGATACC 
TCTATTTTCA AGATCGAGCA GCGTCATCTT CAATTCATCG GCCGCCACTG CAAAAGAGGG 



1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 
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AGGAACATCT CTAGGAATTT GTTCTCGTTT GTCTTCTTGC TCTAGTATTT CTACACATAG 2520 

TCGGCCTTTG AGAGAATGCT TGCATTGCTC CGGGATATTA TTACATTCAA CCGCCATAGT 2580 

GGCTTGTTTT GCGATCATGA GTGCGGTTCT ACCTTCCAAA GTTGCTTCTG ATGCACTTGC 2640 

ACCTTTTTCC AATAGAGATA GTATCAATTG TGGCTCCTTC CGCATCGCAG CAACATGAAG 2700 

CACCGTATAT CCCCTCGGAT TCCTATGGTT GACATCGGCA AGATCAAGTT TTAAAAGATC 2760 

TGTTGCGGTC TTCACATTGC AATATGCAAC AGCGAAATGA AGAGCACACG CATCATCTAG 2820 

ATTGGTGTGA TCCTCTTTCA AAAGCAACTT GACTAACTCA ATATCATCCG AGTCAAGTGC 2880 

CTTATGTACA TTCGAGACAT GTTTCTTTAC TTTAGGTACC TCCAAACCAA GCTCTTTACG 2940 

TCTATCAATT ATCTCTTTAA CAAGCTCTTC CGGCAATGAC TTTTCAAGAC TAACCATATC 3000 

TACATTAGAC TTGACAATAA TCTCTTTACA TCTATCCAAT AGCTTCATAC AAGCTTTACC 3060 

ACATATATTA GCAAGCTTGA GTATAACCAA TGTGTCCTCT ATAACAACTT TGTCTACAAC 3120 

GTCCAATAAG TGCCTCTGAA ATACAAATAC AAGTACTCAA GTAAGAACAT ATTCATGAAT 3180 

GTGTAACCAT AGCTTAATGC AGATGGTGTT TTACCTCATA GAGAGTAATT AATTCAGGGA 3240 

TCTTGAAGAT GAAAGCCAAA TAGAGAACCT CCAACATGAA ATCCACCGCC GGCCGGCAAG 3300 

CCACGTGGCA GCAATTCTCG TCTGCGCATT CAGAAACTCC TTTAGGCGGC GGTCTCACTC 3360 

TGCTGCTGTA AACATAAGCC AAAACAGTCA CAACCGAATC GAAACCGACT TCGTAATCCT 3420 

TGGCAATCTC CTTAAGCTCG AGCTTCACGG CGGCGGTGTT GTTGGAGTCT TTCTCCTTCT 3480 

TAGCGGCGGC TAAAGCGCTC TTGAAGAAAG AGCTTCTCGC TGACAAAACG CACCGGTGGA 3540 

AAGAAACTTC CCGGCCGTCG GAGAGAACAA GCTTAGCGTC GCTGTAGAAA TCATCCGGCG 3600 

AGTCAAAGAC GGATTCGAAG CTGTTGGAGA GCAATTGCAG AGCAGATACA TCAGGTCCGG 3660 

TGAGTACTTG TTCGGCGGCC AGATAAACAA TAGAGGAGTC GGTGTTATCG GTAGCGACGA 3720 
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AACTAGTGCT GCTGATTTCA TAAGAATCGG CGAATCCATC AATGGTOGTG TCCATCAACA 
GGTTCCGATG AATTGAAATT CACAAATTAA AGACATCTCT GCTAATCAAC GAAGAGACCT 
TATCAACTGG ATTO^GTTAA AGATCGAAGA TAACCATTGA CGAGCAGAGC CAAGTCAAGT 
CAACGAGAGT GGTGGTGAGA TATGAAGAAG CATCCTCGTC CCACGGTrPA CATTTCACCA 
AAACCGGTAA ATTTCCAGGA AAGGAATCTT TGTCAGAGAT CTTTTTTAAA AAGATATAAC 
AGGAAGCTAA ACCGGTTCGG GTTATAAATG TTAGTATTTA TACCGGAGAC ATTrnSTCTT 
GCTAATTTrr GTATATGAGA AGrrCAAICC GGn^GGTAA GCCCCTGAAC CAAACTAGAT 
TTGGAGATGA TATAAATATA TAAAATTTAT TTTTCATCCG GTTCGTTATT TTCATATAAA 
TATATAAATA TTATTTTTTA AATTTAAGAA TTAGATTTAC AT.5TGAAAGT TACATTTCTC 

TTTArrrrcT ttgaagtaaa atgataaagg gaacgtatat taagtttcat gctttattca 

CATAAGTTTT GTAATGTATA TTATATTTTT CGTTTATTGA AAAAGTAATT TTCAGTCTTC 
AGCATGTTTA CACTATAATT AAATCAAGTC GAATATTTCC TGGAACTATT CTCCTTGTTC 
TATAGCAAAT GAAAACGCTC TTCACAACAA AATCArTATA GATATAGGAA TAAATTACAT 
TAAAAACATG AAAGTCATAA TGAATATATT TTTTTAATTA GGArrTCATT TAAAAACAAT 
TATTGTATAC ATATAAAAGA CTTCTTTAGT TATTTGCCTT CAACTTCTCG TTCTGAATCA 
I^CGATAAAT CAGCTTXrrC AATAACTACG ACGTAAAAGC AAATTCATAA CACGTCTAAA 
CAAATTTGGC TCATCCTTCA CTTGATTGGT GTTTTCCGGA CTCGATGTTO CTCGAAACTG 
AGAAGAAGAA GGAATCTGCA TAATCACCTC TTGGTTCCTC ACCGGTAGAC TCATTTTGTT 
GGATCGAAAA CGATCGAGAT CAGAAAATGA AAAGATAGGT TAAAGATGCC TATGAATACA 
ACAACGTAAG ATTATGTl^A ATAAACAGAG TACTTTATAT AGGAGTTATA ATAAGGTAAA 
TAAATTAT^ CTITCCGCGT TTTrTACTTT TGTATrTCTT AAATGATAAG TTAAATTAGG 
ATAAGATTTG TATGATTTTA AGTAAATTTA CAATAACTCT CTATAACTCA ATAGCATCAC 
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ATATTTAATT AATTTTACTA ATTATCTTTT GAACAATTTT ATGAAATAGT TTTCTTTTAA 
TTAATTTTTT AAAATGATAT ATTATAAAAT TTAATTGAAT CAATCTCATA TAATTTTTTT 
ATCTTCTACC ATCTATTATA GTTGATAAAT ATTGTGATAA ACTTTAGATA AACACCCAAT 
TGCCAAATAT TTAATAAATT TTGTGTACCA TGCGTTTTTT TTGGAGAATA TATATACGTG 
GACAGCATAC CGTACATATA TTGTATAAAA GCTTATAAAA CATAGATACG GGTTATATTG 
GTAAGCTATA AATATATGTA AACAATAGTA AGATATTACG TGTTGTGTCT AAATATGTGT 
TGCTTTAGAT ATTATGTATA TCTAATATAT TAAAATATCT TTTATTAACT AATATATTAT 
TTAAGAGAGA AAATTGGGAC ACTATTTTCT ATACAGTAAC TGTTTTCAAC TATAAACAGG 
AACCCTTGAT ATAATAAAAT AACTAGCCAA AAAATCAGAT TAAATATTCA TAAAACAATG 
TTTGGTATTA TTACATAAAC CTAAGAAACA AAATTCAATA TTCCTTTTTA GCTTATAAAA 
AACAATTAAA CATCACTAGA TATATTTATG CCCCACAATG AGCGAGCCAA TTGAGACTTG 
AGACTTGAGA TCCTTGTCAA CTACGTTTGC ATTTGTCGGC CCATTTTTTT TATTTTTTTT 
TTAAAGTGTC GGCCCGTTGC TTCTTCCGTT CAGATCAACC CTCTCGTAAT CAGAACAAAA 
CGGAAAACAA ACGAAAGAAC AATCAGATCC CTCTTTTTTT GCATAAACTA AATTCAACTT 
CTCTGCGTTT ATGTTGTAGA GGCAACCACG ATCACTACTA CGAAACAATA CAACGTCGTT 
GCTTGGAGTC CACGTAATCA AATCTACTCC AATGCTTTTA ATATCTTTCA CTTTAACCCA 
CGACTTTTCA AAACTGCTCT TTAAAACCCA TAACTCGTGA ACATCTTCTT GATCTTTGTT 
TGTCCACTGA CGAATAGCAC CTAGCTTCCC TTCGTATCTG ACTAATCCTG AGAAAACATC 
AGAGTTCGGA GTATGGAAGA AGGACCAAGT TTCGGTTTTG AGACAAAACC GGATCACATT 
GrrCTTCCGT GATATCCAAT GCAAGAACCC CGAAACTTGT ATCGGGTTGG AAAAAATTAA 
TCTGTCTGTT TTTGGTAGAC GCAAATTTTC TAATCTCTTC CAGGTAAACG A^^.TCAGAATC 
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<=«a«:ttcg CACATAA^G TTCTGTOATT CAAATCGTAC ATACCCCGAC ACATACACAT „60 
ACGCCGAOAC TccGAAAa:c ™,=ta™t atacc«»aa ogottcaa^: COATTACCCC 

TAAACCCAAT OACATATCCC AACCCTTCAC TTCTOOCTTT OGTAT^ACCT OATACT^TTT „80 

AG^STTCOT TTGAAOACTA TCTAK:CACG ^ATGGTTTT GTATAC^AA CACAAAGCAA „40 

TATCCCA«» C™=CA^C AAG^AT CTTTA^T. CCGGGTCCCA CAAAOTCGAT ,eOO 

GGAOACTCCA TTG^A AA^AC.CCT C«A^CA AAAC^ CAAGTKCTO 66.0 

^GTTTTACT ATGTAGTGTT GTA^GTA ^CGAAATA CGATTGGTTC TAAGGAGA^ 6730 

AAGAT^CA AACCA^CT CGTAGCT^ c^^,, TCTTTATTCA GGAGCCT«AA 6,80 

^CGATT m«ACGCCG GAAOAPAAGA AAGAAATTCT ^GATCA^T CTTGAT^AT 6640 

CACCGGAGAA CTCATGATCC TO^GGGAAT AAAGAGA^. GCACOATCAC «=AATGAGAA 6,00 

ATGAAAAAAT CA«3ATC=GT AGAGAACAAC TTA^^ ^^^^ 

TC^GTTTA AGGAAAGTAT CAAAATTTGC CTTTTTCTTC GCTAGTCCXA AAACAAACAA 7030 
ATTAACGAAA AGATAAAATC TTTCATCATT AATCTTACTT GTGATACCTT AAGCCAAAAC 

t™«:ttta gacttttaac caaatctaca gtaatttaat tgctagactt aggaaacaac 
"tt™t acccaacaat ctttggattt taattctttt tttttctact aatagattaa 

CAACTCATTA TATAATAATG TTOTATCAT AA^ACAAT TCrTTCTO^ TAATAAACAX ,360 

CCAGC^TA TAATAATCCA CAAGTCAATT TCACCATTTT G=CCAA™.A TTrrCTTATA ,330 

MAATTAGCA CAAAAAAGAT TAT^AWGTT TAGCAGA^rT AATTTCTAAT TAACTTACGT ,3e„ 

AAT^GCArr TTCCATAGAT TTA«:T^. ^Am^c TTAG^ATCT TAGTAc™ ,440 
TTAGTTTccr tagtaa™ AAATTTTAAO ATAATATATT GAAATTAAAA GAAGAAAAAA 
AACTCTACTT ATACTTTTCT TAAATGTTTC ATCACACTAA CTAATAATTT TTTTTAGTTA 
AATTACAATA TATAAACACT GAAGAAAGTT rrTCGCCCAC ACTTTTTTCG GATCAATTAG 



7080 
7140 
7200 



7500 
7560 
7620 
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TACTATAGTT AGGGGAAGAT TCTGATTTAA AGGATACCAA AAATGACTAG TTAGGACATG 
AATGAAAACT TATAATCTCA ATAACATACA TACGTGTTAC TGAACAATAG TAACATCTTA 
CGTGTTTTGT CCATATATTT GTTGCTTATA AATATATTCA TATAACAATG TTTGCATTAA 
GCTTTTAAGA AGCACAAAAC CATATAACAA AATTAAATAT TCCTATCCCT ACCAAAAAAA 
AAAATTAAAT ATTCCTACAG CCTTGTTGAT TATTTTATGC CCTACGTTGA GCCTTGTTGA 
CTAGTTTGCA TTTGTCGGTC CATTTCTTCT TCCGTCCAGA TCAACCCTCT CGTAATCAGA 
ACAAAAGGGG AAACAAACGT AAGAGGCAAA ATCCTTGTTT GTATGAACTA AGTTTAACTT 
CTCTGTGTTT AAGTTGTAGA GGCAAACATG ATCCCAACTA GAAAGCATTA CGACGTCGTT 
GCTTGGTATC CACGTAATAT GCTCTACTCC AATGCTTTCA ATATCTTTCA CTTTTTCCCA 
CGACTTTTCA AAACTGCTCT TTAAAACCCA TAATCTGTGA ACATCTTCTT GATTGTTCTT 
TATCCAGTGA CGAATAACAC CTAGCTTCCC TTCGTAGCTG ACTAACTCTG GGAATAAACC 
AACGTTTGGA GTATGTAAGA AAGACCAAGT TTCGGTTTTG GGACATAACC GGATCACATT 
GTGGTTCCAT GATCTCCAAT GCAAGAACCC TGAAGCTTGT ACCGGGTTTG AAAGAATTAG 
ACCGTCTGTT CTCGGTAGAC GCAAATTTTT TAATCTCTTC CACATAAACG AATCGGAATC 
AAAAACTTCG CACGCAAAAG TTCTGAGATT CCGAGTCATA CCAGGCGATT TCGAAAGCCT 
AAATATTTTA TACCGGAAAG GCTGCAATCC GGTTACCGTT AGACCTAATG ACTTATCACA 
ACTCCTCACT TTTGGGTTTG GTATGATCTG ATACTGTTTT GTTGTTGGTT TGCAGACTAT 
GTATTCCGGT ATTGGTCTTG TATCATTATA ACAAAGCAAT ATCCCATGAC GTGCATCACA 
AGCTTTGATC TTTACCTCTC CTTGTGGCAG AAAATCGATG GAGACTCCTT TGTTATCCAA 
ATCTCTCCTC TCATGGAAAA AACTGGTATC AAGTTTGTAT CCTCTTTCGT AGCGTTCTAG 
GAAGTATCCA GAGATATTGT TGGTTCGATG GAGATTTAGG TTGACAAACC AAGACTCGTA 



7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 
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ce^™ ™c«rc.T ™™„«0 CCTCMTTTT cco.r^^ »ccccco«o e«o 
ATMC^AOA ^CCTCT^;* TCO«=T^ .™„^ec ^TCT«„ ,„oo 

GGMAAAAOA AA=»«<^OA TOAO^ACOAT CAO^^ATOA GATATATAOA AATCAGGATT 
«TA«CAAC COACOATOAT GAATATACAA O^^TTTATAA OTATCACAAA TTOCCTOTT 
C^TTCOCTACT CCCAAAACAA OCAAA^AAC CAAAOATAAA ATCT«a™ a^.ttTCC. 
TTTTC^ CA0TCCCA3A TAAAAATATA TATAAAATAT T^A^^OT TACT^aOT 
Accrue CAAACTTTCT CTTT^^AC^ „AACCAAA. TAACA^TAAT OTAATASCA 
OACT.AOAAA ACAACATTTT GTATATATAT ^TTTOACAT CAAAATTCAA CAATCTTTOO 
OTTTCTATA. «=TTTTTTTT CTTAT.C.AA TAOATTACCA C^ATTATAT CATATACAAA 
OTCTTTCCTT TTCAATCAAC A«:CATTTTC TTTAAAAATT AGCAAOTTTO TTCrTATATC 
.T^AT^A^ AOATTTCTTA ATTAAACTTA O^^CA TTT^CACCT ATA,,^ 

™act ttaotact^ aaa™.ax atatataa™ TA^AAAA^ AAAACTAAAA 

ACTCCAGTTT AACTTATGTT AAATCTTTCA TCACACTAAA aoi.^^. 

ICACACTAAA AGAGCATTAA GTAATAAATA 

™<=C™ AO^AAAAAAA ArATCAAATC ACTGAA^„ 

OCCAATTAOT AATAOACTAA TAOTAACTCA ta^ata^. CTCTAA,^. 
OOCOAAACCA ATATT^OA. TCTAAA<=ATA OTAAAAATOA AT™a«» a<^AATACT ,e40 
™aca cctacaaaca otaacotaca aacc™.„ ^^^^ 

TCAAGAAGTT CTCATCGAT 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5655 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



9060 
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9240 

9300 
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9420 
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9600 

9660 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI "SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 2787_3347 

(D) OTHER INFORMATION: /product= "1st exon of NIMl " 

(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION; 3427, .4162 

(D) OTHER INFORMATION: /product= "2nd exon of NIMl " 

(ix> FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 4271.. 4474 

(D) OTHER INFORMATION: /product= -3rd exon of NIMl" 

(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 4586.. 4866 

(D) OTHER INFORMATION: /product= "4th exon of NIMl - 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: join (2787 3347 , 3427.. 4162, 4271. 4474 
4586. .4866) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

TGTGATGCAA GTCATGGGAT ATTGCTTTGT GTTAAGTATA CAAAACCATC ACGTGGATAC 60 

ATAGTCTTCA AACCAACCAC TAAACAGTAT CAGGTCATAC CAAAGCCAGA AGTGAAGGGT 120 

TGGGATATGT CATTGGGTTT AGCGGTAATC GGATTGAACC CTTTCCGGTA TAAAATACAA 180 

AGGCTTTCGC AGTCTCGGCG TATGTGTATG TCTCGGGGTA TCTACCATTT GAATCACAGA 240 
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ACTTTTATGT GCGAAGTTTT CGATTCTGAT TCCTTTACCT GGAAGAGATT AGAAAATTTC 300 

CGTCTACCAA AAACAGACAG ATTAATTTTT TCCAACCCGA TACAAGTTTC GGGGTTCTTG 360 

CATTGGATAT CACGGAACAA CAATGTGATC CGGrmGTC TCAAAACCGA AACTTGGTCC 420 

rrcrrCCATA CTCCGAACTC TGATGTTTTC TCAGGATTAG TCAGATACGA AGGGAAGCTA 480 

GGTGCTATTC GTCAGTGGAC AAACAAAGAT CAAGAAGAI^ T«:ACGAGTT A^TTTTA 540 

AAGAGCAGTT TTGAAAAGTC GTGGGTTAAA GT^AAAGATA TPAAAAGCAT T^GTAGAT 600 

I^GATTACGT GGACTCCAAG CAACGACGTT GTATTGTTTC GTAGTAGTGA TCGTGGTTOC 660 

CTCTACAACA TAAACGCAGA GAAGTTGAAT TTAGT^TATG CAAAAAAAGA GGGAT^^AT 720 

TGTTCrrrCG TTTGTTTTCC CTTTTGTTCT GATTACGAGA GGGTTCAO^T GAACGGAAGA 780 

AGCAACGGGC CGACACTrTA AAAAAAAAAT AAAAAAAATG GGCCGACAAA TGCAAACGTA 840 

GTTGACAAGG ATCTCAAGTC TCAAGTCTCA Al^^Gc TCAT^CGG GCATAAATAT 900 

ATCTAGTGAT GTTTAATTGT TTTTTATAAG GTAAAAAGGA ATATTGAATT rTGTrrCrTA 960 

GGTTTAIX.TA ATAATACCAA ACAT^™ A^AATAn^ AATC^A^ TTO^GCTAGT 1020 

TATTTTATTA TATCAAGGGT TCCTGTITAT AGTTGAAAAC AGrTACTGTA TAGAAAATAG 1080 

TG^CCAATT ri^C^O^A AATAATATAT TAGTT^,^ ^OATATTTT AATATAT.AG 1140 

ATATACATAA TATCTAAAGC AACACATATT TAGACACAAC ACGTAATATC TTACTAOTGT UOO 

rrACATATAT TTATAGCTTA CCAATATAAC CCGTATCTAT GT.TTATAAG CTTTTATACA 1260 

ATATATGTAC GGTATGCTGT CCACGTATAT ATATTCO^CA AAAAAAACGC ATGGTACACA 1320 

AAArrTATTA AATAT^C AArTGGGTGT rTATCTAAAG T^TATCACAA TATTTATCAA 1380 

CTATAATAGA TGGTAGAAGA TAAAAAAATT ATATCAGATT GATTCAATTA AATTTTATAA I440 

TATATCArrr TAAAAAATTA ATTAAAAGAA AACTArrxCA TAAAATTGTT CAAAAGATAA ISOO 
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TTAGTAAAAT TAATTAAATA TGTGATGCTA TTGAGTTATA GAGAGTTATT GTAAATTTAC 1560 

TTAAAATCAT ACAAATCTTA TCCTAATTTA ACTTATCATT TAAGAAATAC AAAAGTAAAA 1620 

AACGCGGAAA GCAATAATTT ATTTACCTTA TTATAACTCC TATATAAAGT ACTCTGTTTA 1680 

TTCAACATAA TCTTACGTTG TTGTATTCAT AGGCATCTTT AACCTATCTT TTCATTTTCT 1740 

GATCTCGATC GTTTTCGATC CAACAAAATG AGTCTACCGG TGAGGAACCA AGAGGTGATT 1800 

ATGCAGATTC CTTCTTCTTC TCAGTTTCCA GCAACATCGA GTCCGGAAAA CACCAATCAA 1860 

GTGAAGGATG AGCCAAATTT GTTTAGACGT GTTATGAATT TGCTTTTACG TCGTAGTTAT 1920 

TGAAAAAGCT GATTTATCGC ATGATTCAGA ACGAGAAGTT GAAGGCAAAT AACTAAAGAA 1980 

GTCTTTTATA TGTATACAAT AATTGTTTTT AAATCAAATC CTAATTAAAA AAATATATTC 2040 

ATTATGACTT TCATGTTTTT AATGTAATTT ATTCCTATAT CTATAATGAT TTTGTTGTGA 2100 

AGAGCGTTTT CATTTGCTAT AGAACAAGGA GAATAGTTCC AGGAAATATT CGACTTGATT 2160 

TAATTATAGT GTAAACATGC TGAACACTGA AAATTACTTT TTCAATAAAC GAAAAATATA 2220 

ATATACATTA CAAAACTTAT GTGAATAAAG CATGAAACTT AATATACGTT CCCTTTATCA 22 80 

TTTTACTTCA AAGAAAATAA ACAGAAATGT AACTTTCACA TGTAAATCTA ATTCTTAAAT 2340 

TTAAAAAATA ATATTTATAT ATTTATATGA AAATAACGAA CCGGATGAAA AATAAATTTT 2400 

ATATATTTAT ATCATCTCCA AATCTAGTTT GGTTCAGGGG CTTACCGAAC CGGATTGAAC 2460 

TTCTCATATA CAAAAATTAG CAACACAAAA TGTCTCCGGT ATAAATACTA ACATTTATAA 2520 

CCCGAACCGG TTTAGCTTCC TGTTATATCT TTTTAAAAAA GATCTCTGAC AAAGATTCCT 2580 

TTCCTGGAAA TTTACCGGTT TTGGTGAAAT GTAAACCGTG GGACGAGGAT GCTTCTTCAT 2640 

ATCTCACCAC CACTCTCGTT GACTTGACTT GGCTCTGCTC GTCAATGGTT ATCTTCGATC 2700 

TTTAACCAAA TCCAGTTGAT AAGGTCTCTT CGTTGATTAG CAGAGATCTC TTTAATTTGT 2760 

GAATTTCAAT TCATCGGAAC CTGTTG ATG GAC ACC ACC ATT GAT GGA TTC GCC 2 813 
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Met Asp Thr Thr He Asp Gly Phe Ala 



1 5 



GAT TCT TAT GAA ATC AGC AGC ACT AGT TTC GTX: GCT ACC GAT AAC ACC 
ASP ser Tyr Glu He Ser Ser Thr Ser Phe Val Ala Thr Asp Asn Thr 

GAC TCC TCT ATT GTT TAT CTG GCC GCC GAA CAA GTA CTC ACC GGA CCT 
ASP ser ser He Val Tyr Leu Ala Ala Glu Gin Val Leu Thr oly Pro 

40 

GAT GTA TCT GCT CTG CAA TTG CTC TCC AAC AGC TTC GAA TCC GTC TTT 
ASP val ser Ala Leu Gin Leu Leu Ser Asn Ser Phe Glu Ser Val Phi 
« 50 55 

GAC TCG CCG GAT GAT TTC TAC AGC GAC GCT AAG CTT GTT CTC TCC GAC 
ASP ser Pro Asp Asp Phe Tyr Ser Asp Ala Lys Leu Val Leu Ser Asp 

70 

OGC cao =TT TCT CAC COG TGC GTT TTG TCA GCO »OA AGC TCT 
Gly Arg Glu v,l se, Phe Hi. Ar, cys Val I,eu s.r Al, Ar, s„ s" 



80 



85 



TTC TTC AAG AGC GCT rrA GCC GCC GCT AAG AAG GAG AAA GAC TCC AAC 
Phe Ph. .ys ser Al. Al. Al. Al, ,J 



100 



105 



AAC ACC GCC GCC GTG AAG CTC GAG CTT AAG GAG ATT GCC AAG GAT TAC 
Asn Thr Al. Al. v.l Ly, !..„ Glu :.eu Ly. Glu He Al. I,y= Asp T^r 
110 ij_5 



120 



GAA GTC GGT TTC GAT TCG GTT GTG ACT GTT TTG GCT TAT GTT TAC AGC 
Olu Val Oly Phe Asp Ser Val Val Thr Val Leu Ala 'Tyr Val ^r S^ 

130 

AGC AGA GTG AOA CCG CCG CCT AAA GGA GrP TCT GAA -TOC GCA GAC GAG 
ser Arg Va Arg Pro Pro Pro Lys Gly Val Ser Glu Cys Ala Asp Glu 

150 

AAT TGC TGC CAC GTG GCT TGC CGG CCG GCG GTG GAT TTC ATG TTG GAG 
Asn cys Cys His Val Ala Cys Arg Pro Ala Val Asp Phe Met Leu Glu 
155 150 



2861 



2909 



2957 



3005 



3053 



3101 



3149 



3197 



3245 



3293 



165 
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GTT CTC TAT TTG GCT TTC ATC TTC AAG ATC CCT GAA TTA ATT ACT CTC 3341 
val Leu Tyr Leu Ala Phe lie Phe Lys He Pro Glu Leu He Thr Leu 

175 180 185 

TAT CAG GTAAAACACC ATCTGCATTA AGCTATGGTT ACACATTCAT GAATATGTTC 3397 
Tyr Gin 



TTACTTGAGT ACTTGTATTT GTATTTCAG AGG CAC TTA TTG GAC GTT GTA GAC 3450 

Arg His Leu Leu Asp Val Val Asp 
190 195 

AAA GTT GTT ATA GAG GAC ACA TTG GTT ATA CTC AAG CTT GCT AAT ATA 3498 
Lys val val lie Glu Asp Thr Leu Val He Leu Lys Leu Ala Asn lie 
200 205 210 

TGT GGT AAA GCT TGT ATG AAG CTA TTG GAT AGA TGT AAA GAG ATT ATT 3546 
Cys Gly Lys Ala Cys Met Lys Leu Leu Asp Arg Cys Lys Glu He He 
215 220 225 

GTC AAG TGT AAT GTA GAT ATG GTT AGT CTT GAA AAG TCA TTG CCG GAA 3594 
val Lys Ser Asn Val Asp Met Val Ser Leu Glu Lys Ser Leu Pro Glu 
230 235 240 

GAG CTT GTT AAA GAG ATA ATT GAT AGA CGT AAA GAG CTT GGT TTG GAG 3642 
Glu Leu val Lys Glu He He Asp Arg Arg Lys Glu Leu Gly Leu Glu 
245 250 255 

GTA CCT AAA GTA AAG AAA CAT GTC TCG AAT GTA CAT AAG GCA CTT GAC 3690 
Val Pro Lys Val Lys Lys His Val Ser Asn Val His Lys Ala Leu Asp 
260 265 270 275 

TCG GAT GAT ATT GAG TTA GTC AAG TTG CTT TTG AAA GAG GAT CAC ACC 373 8 

Ser Asp Asp He Glu Leu Val Lys Leu Leu Leu Lys Glu Asp His Thr 
280 285 290 

AAT CTA GAT GAT GCG TGT GCT CTT CAT TTC GCT GTT GCA TAT TGC AAT 3786 
Asn Leu Asp Asp Ala Cys Ala Leu His Phe Ala Val Ala Tyr Cys Asn 
295 300 305 

GTG AAG ACC GCA ACA GAT CTT TTA AAA CTT GAT CTT GCC GAT GTC AAC 3834 
Val Lys Thr Ala Thr Asp Leu Leu Lys Leu Asp Leu Ala Asp Val Asn 
310 315 320 



SUBSTITUTE SHEET (RULE 26) 



wo 97/49822 



PCT/EP97/01218 



-96- 



CAT AGG AAT CCG AGO GGA TAT ACQ GTG CTT CAT GTT GCT GCG ATG CGG 
His Arg Asn Pro Arg Gly Tyr Thr Val Leu His Val Ala Ala Met Arg 
325 330 

AAG GAG CCA CAA TTG ATA CTA TCT CTA TTG GAA AAA GGT GCA AGT GCA 
Lys Glu Pro Gin Leu lie Leu Ser Leu Leu Glu Lys Gly Ala Ser Ala 

345 350 

TCA GAA GCA ACT TTG GAA GGT AGA ACC GCA CTC ATG ATC GCA AAA CAA 
Ser Glu Ala Thr Leu Glu Gly Arg Thr Ala Leu Met He Ala Lys Gin 
360 

GCC ACT ATG GCG GTT GAA TGT AAT AAT ATC CCG GAG CAA TCC AAG CAT 
Ala Thr Met Ala Val Glu Cys Asn Asn He Pro Glu Gin Cys Lys His 
375 380 385 

TCT CTC AAA GGC CGA CTA TGT GTA GAA ATA CTA GAG CAA GAA GAC AAA 
ser Leu Lys Gly Arg Leu Cys Val Glu He Leu Glu Gin Glu Asp Lys 

395 400 

CGA GAA CAA ATT CCT AGA GAT GTT CCT CCC TCT TTT GCA GTG GCG GCC 
Arg Glu Gin He Pro Arg Asp Val Pro Pro Ser Phe Ala Val Ala Ala 
405 410 

GAT GAA TTG AAG ATG ACG CTG CTC GAT CTT GAA AAT AGA G 
Asp Glu Leu Lys Met Thr Leu Leu Asp Leu Glu Asn Arg 
420 425 

GTATCTATCA AGTCTTATTT CTTATATGTT TGAATTAAAT TTATGTCCTC TCTATTAGGA 

AACTGAGTGA ACTAATGATA ACTATTCTTT GTGTCGTCCA CTGTTTAG TT GCA CTT 

Val Ala Leu 
435 

GCT CAA CGT CTT TTT CCA ACG GAA GCA CAA GCT GCA ATC GAG ATC GCC 
Ala Gin Arg Leu Phe Pro Thr Glu Ala Gin Ala Ala Met Glu He Ala 
440 445 450 

GAA ATG AAG GGA ACA TGT GAG TTC ATA GTG ACT AGC CTC GAG CCT GAC 
Glu Met Lys Gly Thr Cys Glu Phe He Val Thr Ser Leu Glu Pro Asp 
455 460 465 



3882 
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3978 



4026 
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CGT CTC ACT GGT ACQ AAG AGA ACA TCA CCG GGT GTA AAG ATA GCA CCT 
Arg Leu Thr Gly Thr Lys Arg Thr ser Pro Gly val Lys He Ala Pro 

475 480 

TTC AGA ATC CTA GAA GAG CAT CAA AGT AGA CTA AAA GCG CTT TOT AAA 
Phe Arg He Leu Glu Glu His Gin Ser Arg Leu Lys Ala Leu Ser Lys 
485 490 495 

ACC G GTATGGATTC TCACCCACTT CATCGGACTC CTTATCACAA AAAACAAAAC 

Thr 
500 

TAAATGATCT TTAAACATGG TTTTGTTACT TGCTGTCTGA CCTTGTTTTT TTTATCATCA 

G TG GAA CTC GGG AAA CGA TTC TTC CCG CGC TGT TCG GCA GTG CTC 
Val Glu Leu Gly Lys Arg Phe Phe Pro Arg Cys Ser Ala Val Leu 
505 

GAC CAG ATT ATG AAC TGT GAG GAC TTG ACT CAA CTG OCT TGC GGA GAA 
Asp Gin He Met Asn Cys Glu Asp Leu Thr Gin Leu Ala Cys Gly Glu 
"0 525 

GAC GAC ACT GCT GAG AAA CGA CTA CAA AAG AAG CAA AGG TAC ATG GAA 
ASP ASP Thr Ala Glu Lys Arg Leu Gin Lys Lys Gin Arg Tyr Met Glu 
535 540 

ATA CAA GAG ACA CTA AAG AAG GCC TTT AGT GAG GAC AAT TTG GAA TTA 
He Gin Glu Thr Leu Lys Lys Ala Phe Ser Glu Asp Asn Leu Glu Leu 

555 560 

GGA AAT TCG TCC CTG ACA GAT TCG ACT TCT TCC ACA TCG AAA TCA ACC 
Gly Asn ser Ser Leu Thr Asp Ser Thr Ser Ser Thr Ser Lys Ser Thr 
565 570 575 

GGT GGA AAG AGG TCT AAC CGT AAA CTC TCT CAT CGT CGT CGG TCA 
Gly Gly Lys Arg Ser Asn Arg Lys Leu Ser His Arg Arg Arg * 
580 585 59Q 

GACTCTTGCC TCTTAGTGTA ATTTTTGCTG TACCATATAA TTCTGTTTTC ATGATGACTG 
TAACTGTTTA TGTCTATCGT TGGCGTCATA TAGTTTCGCT CTTCGTTTTG CATCCTGTGT 
ATTATTGCTG CAGGTGTGCT TCAAACAAAT GTTGTAACAA TTTGAACCAA TGGTATACAG 



4422 



4470 



4524 



4584 
4629 



4677 



4725 



4773 



4821 



4866 



4926 
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(2) INFORMATION FOR SEQ ID NO: 3: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 594 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Asp Thr Thr He Asp Gly Phe Ala Asp Ser Tyr Glu He Ser Ser 
15 10 15 

Thr ser Phe Val Ala Thr Asp Asn Thr Asp ser Ser He Val Tyr Leu 
20 25 30 



Ala Ala Glu Gin Val Leu Thr Gly Pro Asp Val Ser Ala Leu Gin 
35 40 45 



ATTTGTAATA TATATTTATG TACATCAACA ATAACCCATG ATGGTGTTAC AGAGTTGCTA 5106 
GAATCAAAGT GTGAAATAAT GTCAAATTGT TCATCTGTTG GATATTTTCC ACCAAGAACC 5166 
AAAAGAATAT TCAAGTTCCC TGAACTTCTG GCAACATTCA TGTTATATGT ATCTTCCTAA 5226 
TTCTTCCTTT AACCTTTTGT AACTCGAATT ACACAGCAAG TTAGTTTCAG GTCTAGAGAT 5286 
AAGAGAACAC TGAGTGGGCG TGTAAGGTGC ATTCTCCTAG TCAGCTCCAT TCCATCCAAC 
ATTTGTGAAT GACACAAGTT AACAATCCTT TGCACCATTT CTGGGTGCAT ACATGGAAAC 
TTCTTCGATT GAAACTTCCC ACATGTGCAG GTGCGTTCGC TGTCACTCAT AGACCAAGAG 
ACTGAAAGCT TTCACAAATT GCCCTCAAAT CTTCTGTTTC TATCGTCATG ACTCCATATC 5526 
TCCGACCACT GGTCATGAGC CAGAGCCCAC TGATTTTGAG GGAATTGGGC TAACCATTTC 
CGAGCTTCTG AGTCCTTCTT TTTGATGTCC TTTATGTAGG AATCAAATTC TTCCTTCTGA 
CTTGTGGAT 



5346 



5406 



5466 



5586 
5646 
5655 



Leu 
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Leu ser Asn Ser Phe Glu Ser Val Phe Asp Ser Pro Asp Asp Phe Tyr 
50 55 60 

Ser Asp Ala Lys Leu Val Leu Ser Asp Gly Arg Glu Val Ser Phe His 
" 70 75 80 

Arg Cys Val Leu Ser Ala Arg Ser Ser Phe Phe Lys Ser Ala Leu Ala 
85 90 95 

Ala Ala Lys Lys Glu Lys Asp Ser Asn Asn Thr Ala Ala Val Lys Leu 
100 105 110 

Glu Leu Lys Glu He Ala Lys Asp Tyr Glu Val Gly Phe Asp Ser Val 
115 120 125 

Val Thr Val Leu Ala Tyr Val Tyr Ser Ser Arg Val Arg Pro Pro Pro 
130 135 140 

Lys Gly Val Ser Glu Cys Ala Asp Glu Asn Cys Cys His Val Ala Cys 

150 155 160 

Arg Pro Ala Val Asp Phe Met Leu Glu Val Leu Tyr Leu Ala Phe He 
165 170 175 

Phe Lys He Pro Glu Leu He Thr Leu Tyr Gin Arg His Leu Leu Asp 
180 185 190 

Val Val Asp Lys Val Val He Glu Asp Thr Leu Val He Leu Lys Leu 
195 200 205 



Ala Asn He Cys Gly Lys Ala Cys Met Lys Leu Leu Asp Arg Cys Lys 
210 215 



220 



Glu He He Val Lys Ser Asn Val Asp Met Val ser Leu Glu Lys Ser 

230 235 240 

Leu Pro Glu Glu Leu Val Lys Glu He He Asp Arg Arg Lys Glu Leu 
245 250 255 



Gly Leu Glu Val Pro Lys Val Lys Lys His Val Ser Asn Val His Lys 
260 265 270 
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Ala Leu Asp Ser Asp Asp He Glu Leu Val Lys Leu Leu Leu Lys Glu 

275 280 285 

Asp His Thr Asn Leu Asp Asp Ala Cys Ala Leu His Phe Ala Val Ala 

295 300 



Tyr cys Asn Val Lys Thr Ala Thr Asp Leu Leu Lys Leu Asp Leu Ala 

320 



305 310 3j^5 



Asp Val Asn His Arg Asn Pro Arg Gly Tyr Thr Val Leu His Val Ala 
325 330 

Ala Met Arg Lys Glu Pro Gin Leu He Leu Ser Leu Leu Glu Lys Gly 
340 345 

Ala Ser Ala Ser Glu Ala Thr Leu Glu Gly Arg Thr Ala Leu Met He 
355 360 365 

Ala Lys Gin Ala Thr Met Ala Val Glu Cys Asn Asn lie Pro Glu Gin 
370 375 380 

Cys Lys His Ser Leu Lys Gly Arg Leu Cys Val Glu He Leu Glu Gin 
■^^^ 390 395 

400 



Phe Ala 



Glu Asp Lys Arg Glu Gin He Pro Arg Asp Val Pro Pro Ser 
405 410 

val Ala Ala Asp Glu Leu Lys Met Thr Leu Leu Asp Leu Glu Asn Arg 
420 425 430 

Val Ala Leu Ala Gin Arg Leu Phe Pro Thr Glu Ala Gin Ala Ala Met 
435 440 445 

Glu He Ala Glu Met Lys Gly Thr Cys Glu Phe He Val Thr Ser Leu 
450 455 



Glu Pro Asp Arg Leu Thr Gly Thr Lys Arg Thr Ser Pro Gly Val 
465 470 



Lys 
480 



He Ala Pro Phe Arg He Leu Glu Glu His Gin Ser Arg Leu Lys Ala 
485 490 

Leu Ser Lys Thr Val Glu Leu Gly Lys Arg Phe Phe Pro Arg Cys Ser 
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500 

Ala Val Leu Asp Gin lie Met Asn 
515 520 

Cys Gly Glu Asp Asp Thr Ala Glu 
530 535 

Tyr Met Glu lie Gin Glu Thr Leu 
545 550 

Leu Glu Leu Gly Asn Ser Ser Leu 
565 

Lys Ser Thr Gly Gly Lys Arg Ser 
580 

Arg * 



505 510 

Cys Glu Asp Leu Thr Gin Leu Ala 
525 

Lys Arg Leu Gin Lys Lys Gin Arg 
540 

Lys Lys Ala Phe Ser Glu Asp Asn 
555 560 

Thr Asp Ser Thr Ser Ser Thr Ser 
570 575 

Asn Arg Lys Leu Ser His Arg Arg 
585 590 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY; not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

lie Arg Arg Met Arg Arg Ala Leu Asp Ala Ala Asp He Glu Leu Val 
15 10 15 

Lys Leu Met Val Met Gly Glu Gly Leu Asp Leu Asp Asp Ala Leu Ala 
20 25 30 

Val His Tyr Ala Val Gin His Cys Asn 

35 40 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS ; not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Pro Thr Gly Lys Thr Ala Leu His Leu Ala Ala Glu Met Val Ser Pro 
1 



5 10 15 



ASP Met val Ser Val Leu Leu Asp His His Ala Asp Xaa Asn Phe Arg 

30 



20 25 



Thr Xaa Asp Gly Val Thr 
35 



(2) INFORMATION FOR SEQ ID NO: 6 : 

Ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

{C) STRANDEDNESS: not relevant 
(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO : 6 : 

lie Arg Arg Met Arg Arg Ala Leu Asp Ala Ala Asp He Glu Leu Val 
^5 10 ,5 
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Lys lieu Met Val Met Gly Glu Giy Leu Asp Leu Asp Asp Ala Leu Ala 
20 25 30 

Val His Tyr Ala Val Gin His Cys Asn 
35 40 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 

Arg Arg Pro Asp Ser Lys Thr Ala Leu His Leu Ala Ala Glu Met Val 
15 10 15 

Ser Pro Asp Met Val Ser Val Leu Leu Asp Gin 
20 25 

(2) INFORMATION FOR SEQ ID NO: 8: 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

He Arg Arg Met Arg Arg Ala Leu Asp Ala Ala Asp He Glu Leu Val 
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10 



15 



Lys Leu Met Val Met Gly Glu Gly Leu Asp Leu Asp Asp Al 



20 25 



Val His Tyr Ala Val Gin His Cys Asn 
35 40 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



a Leu Ala 
30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Arg Arg Pro Asp Ser Lys Thr Ala Leu His Leu Ala Ala Glu Met Val 



10 



15 



ser Pro Asp Met Val Ser Val Leu Leu Asp Gin 



20 25 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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He Arg Arg Met Arg Arg Ala Leu Asp Ala Ala Asp He Glu Leu Val 
15 10 15 

Lys Leu Met Val Met Gly Glu Gly Leu Asp Leu Asp Asp Ala Leu Ala 
20 25 30 

Val His Tyr Ala Val Gin His Cys Asn 
35 40 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS ; not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Pro Thr Gly Lys Thr Ala Leu His Leu Ala Ala Glu Met Val Ser Pro 
15 10 15 

Asp Met Val 
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Appiicant's or agenl*s file 

reference number > ^-21212/A/CGC 1909 



'"""""""'^''^'""'^"'^ PCT/EP9 7 / 0 1|2 18 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule Ubis) 



A. Tbc indicaiions made below relate lo the microorganism referred lo in the descripii< 



B. IDENTIFICATION OF DEPOSIT 



Further dcposkis are identified on an additional sheet j | 



Name of depositary insiuution 



Anerican Type Culture Collection (ATCC) 



Address of deposicary insiituiion (including pos^ai code and coumry) 

12301 Parklawn Drive 
Roclcville, MD 20852 
USA 



Date of deposit 



25 SeptMber 1996 (25.09.96) 



Accession Number 



97736 



C, ADDITIONAL INDICATIONS (/i:a>^6iani^ym^>;y|ic«6fc) TTiis information is continued on an additional sheet Q 



request the Expert Solution where available 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (iftkeindicau<nuarcnoif6r^ildcngnaiedSiMles^ 



SEPARATE FURNISHING OF INDICATIONS Ucavebhnk ifnd MppOc^hie) 



For raoeivmg QfCoB iM only 



Fw? * Hiissbeot was noeivBd with the 



Autbocized officer 



Fcm rcr/ROia34 (Jtily 199Z) 



For bitmatkmal Bttraan weoaly 



[j| Thh tbcet WM leoeived by the lotmatkmal Boma « 



Authorized ofiioer 
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IXMI r«rkl««M fkiH* • Rack>M«. MD MU3 USA • Ti N y i io^ n <3<Om.>SM Tcte«t ffM^Tit ATCCROVC • FAX: JOI-il«^M 



BUDAI»£S r TRCAI Y ON THE INTERNATrONAL RECOGNITION OF 
THE DEPOSIT OF MICROORGANISMS FOR THE PURPOSES OF PATENT PROCEDURE 

INTERNA riONAL FOBM 



RECEIPT IN THE CASE OF AN ORIGINAL DEPOSIT ISSUED PURSUANT TO RULE7J 
AND VIABILITY STATEMENT ISSUED PURSUANT TO RULE 10.2 



To: (Nama and Addraaa of Oapoaltor or Attorney I 

Ciba-Geigy Corporation 
Aitention: Leslie B. Frtedrich 
P.O. eox 12257 

Research Trlane<e Park. NC 27709 
Oepositod on Behalf of: Ciba-Geiov Corporation 

Idantification Referenee by Depoiltor: ATCC Deelonmlon 

Cosmid. 07 97736 

The deposit was accompanied by: a scientific description a propoi;ed taxonomic dcscr»ption indicated 

above. 

The deposit was received Septamber 2B. by this international Depository Authority and has been acceoted. 

AT YOUR REQUEST: 
X Wa will inform you o* requescs for the strain for 30 years. 

The strain will be made available if a patent office signatory to the Budapest Treaty certifies one's right to receive, 
or if a U.S. Patent is issued citing the suain, and ATCC is instructed by the United Stetss Paterit & Tradernerk 
Office or the depositor to release said strain. 

if the culture should die or bo destroyed during the effective term of the deptisit. it shell be your responsibility to 
replace it with Hving culture of the same. 

The strain will be maintained for e period uf at least 30 years after the date or deposit, and for a period ot a| least 
five years after the most recent reouesi for a sample. The United States ana many other countries are signatory 
to the Budapest Treaty. 

The viability of the culture cited above was tested Octobef 3. 1Q96 . On that date< the culture was viable. 
International Depcnitory Authority: Americen Type Culture Collection. Rockville, Md. 20852 USA 
Signature of fnirson having authority to represent ATCC: 





cc: 



Andrea C. Walsh. Ph.D.. 



SUBSTITUTE SHEET (RULE 26) 



wo 97/49822 



105/3 



PCT/EP97/01218 



Applicant's or agenl^ file 

rereience number PF, -21212M/CGC 1909 



Iniemaiional application 



r J/EP9 / /0l"2l 



INDICATIONS RELATING TO A DEPOSITED MICROOKGANISM 

(PCr Rule I3bis) 



A. The indicaiions made below rciaic lo the microorgiimsm referred lo in (he description 
on page . , line 



B. IDENTIFICATION OF DEPOSIT 



Further deposits arc ideniified on an additional sheet | | 



Name of depositary insliiution 



AMrican Type Culture Collection (ATCC) 



Address of d epos itary institution (tnciuMng postal code und country} 

12301 Parklawn Drive 
Rockville, MO 20852 
USA 



Date of deposit 



13 June 1996 (13.06.96) 



Accession Number 



97606 



C. ADDITIONAL INDICATIONS Oeauehianki/noiapphcahie) This infomiaiion is continued on an additional aheet Q 



request the Eiqiert Solution where available 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (ifiheindlcauonscrcnciH^anJe^^Siaies) 



E. SEPARATE FURNISHING OF INDICATIONS (learn blank if ^ applicable) 



H^bl^^^D^^^^ " ^ ^"^'"^ ^ imeniational Bureau Uier (specify the gaieraiMiurecfiheindi€atUmt€.g^ Mcmm 



For reoeivmg OCGoe uao only 



This sheet was received with the intcmaiional applicatloii 

Y. Marinu»-v.d, Nouwetand 



Authorized oflicer 



Foim PCr/R0/i34 (JiUy 1992) 



For Intenutkmat Bweaii use only 



Q Iliis sheet WIS received by the IntcnutkMUilBiirami en: 



Authorized ofGcer 
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IMM racktowa Driw • Rackttlk. MO IM» USA • Tcl«piMiwi«MltUI-MM T<te» IMhMS ATCCNOMTH • FAX: 3i|.n*>l«n 



American Type Culture Collection 




BUDAPEST TREATY ON THE INTERNATIONAL RECOGNITION OF 
THE DEPOSIT OF MICROORGANISMS FOR THE PURPOSES OF PATENT PROCEDURE 



INTERNA TIONAL FORM 



RECEIPT IN THE CASE OF AN ORIGINAL DEPOSIT ISSUED PURSUANT TO RULE 7.3 
AND VIABILITY STATEMENT ISSUED PURSUANT TO RULE 10.2 



To: (Name and Address of Depositor or Attorney) 



RECEIVED 



Ciba-Gfilgv Corporation 






Deposited on Behalf of: Clba-Gelgy Corporation 



Identification Reference by Deposlfor: 



ATCC Designation 



Plasmid PI -18 



97606 



The deposit was accompanied by: a scientific description _ a proposed taxonomic deacription indicated 

above. 

The deposit was received Jnna 13. 1996 by this International Depository Authority and has been accepted. 

AT YOUR REQUEST: ^ Wo will inform you of requesis for the strain for 30 years. 

The strain will be made available if a patent office signatory to the Budapest Treaty certifies one's right to 
receive, or if a U.S, Potent is issued citing the strain, and ATCC is lnstrui:ted by the United States Patent & 
Trademark Office or the depositor to release said strain. 

If the culture should die or be destroyed during the effective term of the duposit, it shall be your responsibility 
to replace it with living culture of the same. 

The strain will be maintained for a period of at least 30 years from date of -Jeposit. or five years after the most 
recent request for a sample, whichever is longer. The United States and many other countries are signatory 
to the Budapest Treaty. 

The viability of the culture cited above was tested June ?Q. 1996 . On that date, the culture was viable. 
Internettonal Depository Authority: American Type Culture Collection, Ftockvitle. Md. 20852 USA 
Signature of person having authority to represent ATCC: 





cc: 



Andree C. Welsh. Ph.D. 
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Applicant's or agent*^ file 
reference number 



F /-21212/A/CGC 1909 



InicrnaiionaJ 



"r^ 0 12 18 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulc \3bis) 



A. The indicaiiom m<idc below relate to the microorganisn) referred to in ibc description 
on page , line 

B. IDENTIFICATION OF DEPOSIT Further deposits arc identified on an additional sbecl Q 

Name of depositary ins^tuuiion 

Anarlcan Type Culture Collection (ATCC) 



Address of depositary inslttuuon {inducing postal coJe and country) 

12301 Parklawn Drive 
Rockville, MD 20852 
USA 



Date of deposit 



08 May 1996 (08.05.96) 



Accession Number 



97543 



C. AD DITION AL INDI CATIONS (i€av€ blank Ifnoi applicahl*:} This information is continued on an additional sheet Q]] 



We request the Expert Solution where available 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE(if the indication are not for ati designated Siula) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



Tbeindicationslistedbelow will besubnaiitediotbelnteriuitkMial Bureau bier (j/?^^ 'Ac 
Number trfD^ostt'} 



For icoeivhig Of Gee use only 



P^l Tbiasbeel was leoeivcd with the mtematioaalappKcation 



Authorized ofSccr 

Y. Marlnu»-v.d. hipuwelwd 



— — For intenMtional Bumau uie only 
n This sheet was reoBsved by the lafenwtional 



Authorised oflicer 



Forai PCr/RQ^134 (July 1992) 
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American Type Culture Collection 

l2J0t r«rkl««« Drtv« • RockvMto. MD USA « TrttpHww (M1|UI-9930 T«tcm: ttfHIM ATCCNORTH • FAX: 3QI-rf««lM7 

BUDAPEST TREATY ON THE INTERNATIONAL RECOGNITION OF ^i^A-'^* 
THE DEPOSIT OF MICROORGANISMS FOR THE PURPOSES OF PATENT PROCEDURE 

INTEHNA TiONA L FORM 

RECEIPT IN THE CASE OF AN ORIGINAL DEPOSIT ISSUED PURSUANT TO RULE 73 
AND VIABILITY STATEMENT ISSUED PURSUANT TO RULE 10.2 



To: fName and Addrasa of D«po«itor or AttorMy) 

Clba-Gelgy Corporation 
Attn; Leslie B. Fnedrich 
P.O. Box 12267 

Resaarch Triangle Park. NC 27709 
Oaposatad on Behalf of: Ciba-Gaigy Corporation 
Identification Rafarance by Depositor: 
Ptasmid, BAC4 



"OEIVED 

MAY 2 8 1996 

QBA-GEKSY 
ABRU PATENT DEPT 



ATCC Daalgnotion 

97!543 



The deposit was accompanied by: a scientific description a proposnd taxonomic description Indicated 

above. 

The deposit was received May 8. 1996 by this International Depository Authority and has been accepted. 

AT YOUR REQUEST: ^ We will Inform you of requests for the strain for 30 years. 

The strain will bo made available if a patent office signatory to the Budapest Treaty certifies one's right to 
receivo* or if a U.S. Patent is issued citing the strain, and ATCC is instructed by the United States Patent & 
Trademark Office or the depositor to release said strain. 

If the culture should die or be destroyed during the effective term of the deposit, it shall be your responsibility 
to replace tt with living culture of the same. 

The strain will ba maintained for a period of at least 30 years from date of (lepo$tt« or Uve years alter the most 
recent request for a sample, whichever is longer. The United States and many other countries are signatory 
to the Budapest Treaty. 

The viability of the culture cited above was tested Mav 17. 19Q6 . On that date* the culture was viable, 
tntemoiional Depoaltory Authority: American Type Culture Collection. Rockville. Md. 208S2 USA 
Sigj^ature of person having i^horiiy to represent ATCC: 

. Dare: May 2Q. 1896 



Sigpature of person having mithoriiy tc 
Barbara M. Hailey« AdmHilatrator, Palfti 



nt Depository 



cc: 



Andree C. Walah, Ph.D. 
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What Is Claimed Is: 

1. An isolated DNA molecule comprising a NIM1 gene. 

2. An isolated DNA molecule according to claim 1 . comprising the nucleotide sequence set 
fortli in SEQ ID N0:2. 

3. An isolated DNA molecule of about 9.9kb which encodes the NIM1 gene product. 

4. An isolated DNA molecule according to claim 1, comprising the nucleotide sequence set 
forth in SEQ ID NO:1. 

5. An Isolated DNA molecule of claim 1, encoding the aminoacid sequence of the NIM1 
gene product set forth in SEQ ID NO:2. 

6. An isolated DNA molecule comprising a mutant gene of NIMI of claim 1. which is a nimi 
gene. 

7. Clone BAC-04. ATCC Deposit No. 97543. 

8. A chimeric gene comprising a promoter active in plant operably linked to a heterologuous 
DNA molecule encoding the aminoacid sequence of a NIMI gene product. 

9. A chimeric gene comprising a promoter active in plant operably linked to the 
heterologuous DNA fragment according to claim 3. 

1 0. A chimeric gene comprising a promoter active In plant operably linked to a 
heterologuous DNA molecule encoding the aminoacid sequence set forth in SEQ ID NO:2. 

1 1 . A chimeric gene comprising a promoter active In plant operably linked to a 
heterologuous DNA molecule encoding the aminoacid sequence of a nimi gene product. 

1 2. A recombinant vector comprising the chimeric gene of anyone of claims 8 to 1 1 . 

1 3. A recombinant vector according to claim 1 2, wherein said vector is capable of being 
stably transformed into a host cell. 

14. A recombinant vector according to claim 12. wherein said vector is capable of being 
stably transformed into a plant, plant seeds, plant tissue or plant cell. 
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26. Use of an isolated DNA molecule according to claim 1 and variants thereof in a 
screening method for identifying compounds capable of inducing broad spectrum disease 
resistance in plants. 

27. Use of a plant phenotyp according to claim 17 to identify an isolated gene fragment 
which allows expression of broad spectrum of disease resistance in plants. 

28. Use of an isolated DNA molecule according to anyone of claims 1 to 5 to confer disease 
resistance to plant cells, plants and the progeny thereof. 

29. Use of an isolated DNA molecule according to claim 6 to confer universal disease 
susceptibility to plant cells, plants and the progeny thereof. 

30. Use of resistant plants and the progeny thereof according to claim 20 to incorporate the 
resistant trait into plant lines through breeding. 
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1 5. A Plant expression cassette comprising a chimeric gene of anyone of claims 8 to 11 . 

16. A plant expression cassette comprising a chimeric gene of claim 8 to 10. 

1 7. A plant expression cassette comprising a chimeric gene of claim 1 1 . 

18. A plant expression cassette according to claim 15 to 17 which expresses the chimeric 
gene continously or constitutively. 

19. A Plant, plant cells and tt,e progeny thereof comprising the chimeric gene of anyone of 

claims 8 to 1 1 . 

20. A Plant, Plant cells and the progeny thereof comprising «,e chimeric gene of anyone of 
claims 8 to 10, which have a broad spectrum of disease resistance. 

21 . A plant, plant cells and the progeny thereof comprising the chimeric gene of claim 1 1 . 

22. A Plant, Plant cells and the progeny thereof of claim 19, wherein said plan, is selected 
form the group consisUng of gymnospemis, monocots, and dicots. 

23. A plant, plant cells and the progeny thereof of claim 19, wherein said plant is a crop 
plant. 

24. A plan,. Plan, cells and «,e progeny thereof of claim 23, wherein said plant is selected 
form the group consisting of rice, wheat, bariey, rye, corn, potato, carrot, sweet potato, 
sugar bee,, bean, pea, chicory, lettuce, cabbage, cauliflower, broccoli, turnip, radish, 
spinach, asparagus, onion, garlic, eggplant, pepper, celery, carrot, squash, pumpkin, 
zucchini, cucumber, apple, pear, quince, melon, plum, cherry, peach, nectarine, apricot, 
strawberry, grape, raspberry, blackberry, pineapple, avocado, papaya, mango, banana, 
soybean, tobacco, tomato, sorghum and sugarcane. 

25. use Of an isolated DNA molecule according to claim 1, gene to confer disease 
resistance In plants. 
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W83.1 -4- W84.1 -4- L84 



48 44 40 

I I I 



-14-- 



W84.2 



26 



•-17- 



WL84.4&5 --9-- 



C1C12H07 



CIC12F04 



□ 



I I ' I I I ' ' I I ' ' I ' I ' ' ' ' ' ' ' I ■ I ' I I I I I ■ I , 



Nim 



U85.1 -2- W86.1 



15 17 



I I 

CIC7E03 

I 

CIC10G07 



10 11 12 13 14 15 16 17 18 

- - ' ' ' ' I ' I ' I ' < I I I I I I I r I I I I ■ I I , ■ I I I , . . I , , , , I . 

Schematic representation of the NIm flanking YACs with regard to the genetic map and ttie 
AFLP markers. Below the physical distance is indteated in n x 100 Mb. Only the positions of 
markers W84.2 and WLB4.4&5 were detemiined very accurately, the other markers can only be 
positioned within a certain interval. 
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CIC12H07 



C1C12F0A 
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U8S.1 W86.1 



P1-3 

I 1 

P1-« 

I 1 



' ' « ' ' ' ' ' 1 ' » « ' ' ' ' ' ' ' ' I ■ ' I I ' I I ' I I I I I I I I I I I I I I I I I I 

Positions of PI and BAG clones witli respect to the flanking AFLP markers and YACs. The 
positions of clones P1 -3 and P1-4 with respect to YACs 10Q07 and 7E03 were not determined. 
At the bottom the physical distance in kb's is indicated. The physical distance and extent of 
overlaps are t)est estimates and not exact values. 
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Extended PI and BAG contig covering the South end of YAC CIC12F04 and the flanking 
markers. Clones PI -3, P1-4 and PI -9 overlapped completely with YACs 10G07 and 7E03 and 
were not positioned with respect to these YACs. At the bottom the physical distance in kb's is 
indicated. The physical distance and extent of overlaps are best estimates and not exact values. 
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Schematic representation of ail identified P1 and BAC clones and their relative positions in the 
Nim region. At the bottom the physical distance in kb's is indicated. The physical distance and 
extent of overlaps are best estimates and not exact values . 
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Integrated genetic and physical fine map of the NIM region. The scale on the bottom is in 
kb. 
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Inlegrated map of the NIM region. This map is a best fit of the AFLP fingerprinting and 
restriction mapping data. Shown are the relevant AFLP markers, the BAG and P1 clones, 
the cosmid contig, the sizes of the cosmid inserts (in parentheses) and some restriction 
sites. The scale on the bottom is in Id). 
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Integrated map of the NIM region inclu(Jing the new AFLP markers. This map is a best fit of 
the AFLP fingerprinting anci restriction mapping data. Shown are the relevant AFLP 
markers, the BAG and P1 clones, the cosmid contig, the sizes of the cosmid inserts (in 
parentheses) and some restriction sites. The scale on the bottom is in kb. 
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Schematic representation of the genotypes of the most important recombinants D169 and 
CI 05. Shown is the recombinant chromosome with the Ler portion indicated as a black bar 
and the Ws portion as a gray bar (the non-recombinant chromosome is of the Ws*type). On 
top the positions and approximate distances of the relevant AFLP mariners are shown 
related to the actual size in kb's as shown at the bottom. 
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1 


tgatcatgaa 


ttgcgtgtag 


51 


agaaggcggt 


ggactggtgt 


101 


caagagattg 


agaaggacga 


151 


caattggtcg 


cgtcgggcag 


201 


taactaagtg 


tagttggtcc 


251 


cttcaacttg 


taatttgcag 


301 


aacccactct 


aacagcagag 


351 


caaagctgcg 


ggcagcagaa 


401 


gtattgccta 


aactaattgg 


451 


tggactgcaa 


tatgagcttt 


501 


gcgcgttttg 


ggcgtttcca 


551 


cctctgatca 


tagtggattc 


601 


attgatgatt 


tcctcaggac 


651 


cctttccgct 


taccaaagaa 


701 


actaatatca 


gagagaaggt 


751 


cagtcctcac 


ccggacatag 


801 


ggcaagaacc 


ttgggtctat 


851 


caattccatg 


atatttttgg 


901 


ggctggatcc 


acaagtcaga 


951 


tcaaaaagaa 


ggactcagaa 


1001 


aatcagtggg 


ctctggctca 


1051 


cgatagaaac 


agaagatttg 


1101 


ggtctatcag 


tgacagcgaa 


1151 


aagaagtttc 


catgtatgca 


1201 


gtgtcattca 


caaatgttgg 


1251 


ttacacgccc 


actcagtgtt 


1301 


tgtgtaattc 


gagttacaaa 


1351 


taacatgaat 


gttgccagaa 


1401 


ttggtggaaa 


atatccaaca 


1451 


gattctagca 


actctgtaac 


1501 


atatatatta 


caaatctgta 


1551 


tttgaagcac 


acctgcagca 


1601 


aactatatga 


cgccaacgat 


1651 


cagaattata 


tggtacagca 


1701 


gacgacgatg 


agagagttta 


1751 


gatgtggaag 


aagtcgaatc 


1801 


gtcctcacta 


aaggccttct 


1851 


gcttcttttg 


tagtcgtttc 



ggttgtgttt taaagatagg gatgagctga 
tccattagag ggcagcaaaa gtgtgtagta 
gtatacgttt aaatgcatca gatggaaatg 
attgaataga agaacatgga cttgttaaga 
acatacttgt tgttctatta agccggaaaa 
cagaagagat tgagtgtctg atcagggtac 
ttgaaaagtt tggtgacatg cttaaaactt 
caggaagtaa tcaaagatca gagtttcaga 
ctgcatttca ctcatctaat gggctacttg 
tccctaatcc tgaatttgca tccttcggtg 
cagtccattg aagggtttca acactgtaga 
aaaagacttg aacggcaagt accctatgaa 
tcgacgctga tgattgcttt ttcccgcttg 
gtgtccactg atagttggcg ttggtttctc 
aacacaaagg aaagacgttt gcctcgtctc 
ttgctgttat taacgaaccc ggatcactgt 
cacaggttct gtctggattg tttttgctta 
agactacaac ctggtgagcc ttgtgaagca 
aggaagaatt tgattcctac ataaaggaca 
gctcggaaat ggttagccca attccctcaa 
tgaccagtgg tcggagatat ggagtcatga 
agggcaattt gtgaaagctt tcagtctctt 
cgcacctgca catgtgggaa gtttcaatcg 
cccagaaatg gtgcaaagga ttgttaactt 
atgcaatgga gctgactagg agaatgcacc 
ctcttatctc tagacctgaa actaacttgc 
aggttaaagg aagaattagg aagatacata 
gttcagggaa cttgaatatt cttttggttc 
gatgaacaat ttgacattat ttcacacttt 
accatcatgg gttattgttg atgtacataa 
taccattggt tcaaattgtt acaacatttg 
ataatacaca ggatgcaaaa cgaagagcga 
agacataaac agttacagtc atcatgaaaa 
aaaattacac taagaggcaa gagtctcacc 
cggttagacc tctttccacc ggttgatttc 
tgtcagggac gaatttccta attccaaatt 
ttagtgtctc ttgtatttcc atgtaccttt 
tcagcagtgt cgtcttctcc gcaagccagt 
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1901 tgagtcaagt cctcacagtt cataatctgg tcgagcactg ccgaacagcg 

1951 cgggaagaat cgtttcccga gttccactga tgataaaaaa aacaaggtca 

2001 gacagcaagt aacaaaacca tgtttaaaga tcatttagtt ttgttttttg 

2051 tgataaggag tccgatgaag tgggtgagaa tccataccgg ttttagaaag 

2101 cgcttttagt ctactttgat gctcttctag gattctgaaa ggtgctatct 

2151 ttacacccgg tgatgttctc ttcgtaccag tgagacggtc aggctcgagg 

2201 ctagtcacta tgaactcaca tgttcccttc atttcggcga tctccattgc 

2251 agcttgtgct tccgttggaa aaagacgttg agcaagtgca actaaacagt 

2301 ggacgacaca aagaatagtt atcattagtt cactcagttt cctaatagag 

2351 aggacataaa tttaattcaa acatataaga aataagactt gatagatacc 

2401 tctattttca agatcgagca gcgtcatctt caattcatcg gccgccactg 

2451 caaaagaggg aggaacatct ctaggaattt gttctcgttt gtcttcttgc 

2501 tctagtattt ctacacatag tcggcctttg agagaatgct tgcattgctc 

2551 cgggatatta ttacattcaa ccgccatagt ggcttgtttt gcgatcatga 

2601 gtgcggttct accttccaaa gttgcttctg atgcacttgc acctttttcc 

2651 aatagagata gtatcaattg tggctccttc cgcatcgcag caacatgaag 

2701 caccgtatat cccctcggat tcctatggtt gacatcggca agatcaagtt 

2751 ttaaaagatc tgttgcggtc ttcacattgc aatatgcaac agcgaaatga 

2801 agagcacacg catcatctag attggtgtga tcctctttca aaagcaactt 

2851 gactaactca atatcatccg agtcaagtgc cttatgtaca ttcgagacat 

2901 gtttctttac tttaggtacc tccaaaccaa gctctttacg tctatcaatt 

2951 atctctttaa caagctcttc cggcaatgac ttttcaagac taaccatatc 

3001 tacattagac ttgacaataa tctctttaca tctatccaat agcttcatac 

3051 aagctttacc acatatatta gcaagcttga gtataaccaa tgtgtcctct 

3101 ataacaactt tgtctacaac gtccaataag tgcctctgaa atacaaatac 

3151 aagtactcaa gtaagaacat attcatgaat gtgtaaccat agcttaatgc 

3201 agatggtgtt ttacctgata gagagtaatt aattcaggga tcttgaagat 

3251 gaaagccaaa tagagaacct ccaacatgaa atccaccgcc ggccggcaag 

3301 ccacgtggca gcaattctcg tctgcgcatt cagaaactcc tttaggcggc 

3351 ggtctcactc tgctgctgta aacataagcc aaaacagtca caaccgaatc 

3401 gaaaccgact tcgtaatcct tggcaatctc cttaagctcg agcttcacgg 

3451 cggcggtgtt gttggagtct ttctccttct tagcggcggc taaagcgctc 

3501 ttgaagaaag agcttctcgc tgacaaaacg caccggtgga aagaaacttc 

3551 ccggccgtcg gagagaacaa gcttagcgtc gctgtagaaa tcatccggcg 

3601 agtcaaagac ggattcgaag ctgttggaga gcaattgcag agcagataca 

3651 tcaggtccgg tgagtacttg ttcggcggcc agataaacaa tagaggagtc 

3701 ggtgttatcg gtagcgacga aactagtgct gctgatttca taagaatcgg 

3751 cgaatccatc aatggtggtg tccatcaaca ggttccgatg aattgaaatt 

3801 cacaaattaa agagatctct gctaatcaac gaagagacct tatcaactgg 
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3851 


atttggttaa 


agate gaaga 


3901 


caacgagagt 


ggtggtgaga 


3951 


catttcacca 


aaaccggtaa 


4001 


cttttttaaa 


aagatataac 


4051 


ttagtattta 


taccggagac 


4101 


agttcaatcc 


ggttcggtaa 


4151 


tataaatata 


taaaatttat 


4201 


tatataaata 


ttatttttta 


4251 


tacatttctg 


tttattttct 


4301 


taagtttcat 


gctttattca 


4351 


cgtttattga 


aaaagtaatt 


4401 


aaatcaagtc 


gaatatttcc 


4451 


gaaaacgctc 


ttcacaacaa 


4501 


taaaaacatg 


aaagtcataa 


4551 


taaaaacaat 


tattgtatac 


4601 


caacttctcg 


ttctgaatca 


4651 


acgtaaaagc 


aaattcataa 


4701 


cttgattggt 


gttttccgga 


4751 


ggaatctgca 


taatcacctc 


4801 


ggatcgaaaa 


cgatcgagat 


4851 


tatgaataca 


acaacgtaag 


4901 


aggagttata 


ataaggtaaa 


4951 


tgtatttctt 


aaatgataag 


5001 


agtaaatt ta 


caataactct 


5051 


aattttacta 


attatctttt 


5101 


ttaatttttt 


aaaatgatat 


5151 


taattttttt 


atcttctacc 


5201 


actttagata 


aacacccaat 


5251 


tgcgtttttt 


ttggagaata 


5301 


ttgtataaaa 


gcttataaaa 


5351 


aatatatgta 


aacaatagta 


5401 


tgctttagat 


attatgtata 


5451 


aatatattat 


ttaagagaga 


5501 


tgttttcaac 


tataaacagg 


5551 


aaaatcagat 


taaatattca 


5601 


ctaagaaaca 


aaattcaata 


5651 


catcactaga 


tatatttatg 


5701 


agacttgaga 


tccttgtcaa 


5751 


tatttttttt 


ttaaagtgtc 



taaccattga cgagcagagc caagtcaagt 
tatgaagaag catcctcgtc ccacggttta 
atttccagga aaggaatctt tgtcagagat 
aggaagctaa accggttcgg gttataaatg 
attttgtgtt gctaattttt gtatatgaga 
gcccctgaac caaactagat ttggagatga 
ttttcatccg gttcgttatt ttcatataaa 
aatttaagaa ttagatttac atgtgaaagt 
ttgaagtaaa atgataaagg gaacgtatat 
cataagtttt gtaatgtata ttatattttt 
ttcagtgttc agcatgttta cactataatt 
tggaactatt ctccttgttc tatagcaaat 
aatcattata gatataggaa taaattacat 
tgaatatatt tttttaatta ggatttgatt 
atataaaaga cttctttagt tatttgcctt 
tgcgataaat cagctttttc aataactacg 
cacgtctaaa caaatttggc tcatccttca 
ctcgatgttg ctggaaactg agaagaagaa 
ttggttcctc accggtagac tcattttgtt 
cagaaaatga aaagataggt taaagatgcc 
attatgttga ataaacagag tactttatat 
taaattattg ctttccgcgt tttttacttt 
ttaaattagg ataagatttg tatgatttta 
ctataactca atagcatcac atatttaatt 
gaacaatttt atgaaatagt tttcttttaa 
attataaaat ttaattgaat caatctgata 
atctattata gttgataaat attgtgataa 
tgccaaatat ttaataaatt ttgtgtacca 
tatatacgtg gacagcatac cgtacatata 
catagatacg ggttatattg gtaagctata 
agatattacg tgttgtgtct aaatatgtgt 
tctaatatat taaaatatct tttattaact 
aaattgggac actattttct atacagtaac 
aacccttgat ataataaaat aactagccaa 
taaaacaatg tttggtatta ttacataaac 
ttccttttta ccttataaaa aacaattaaa 
ccccacaatg agcgagccaa ttgagacttg 
ctacgtttgc atttgtcggc ccattttttt 
ggcccgttgc ttcttccgtt cagatcaacc 



SUBSTITUTE SHEET (RULE 26) 



wo 97/49822 



20/37 



PCT/EP97/01218 



5801 


ctctcgtaat 


cagaacaaaa 


5851 


ctcttttttt 


gcataaacta 


5901 


ggcaaccacg 


atcactacta 


5951 


cacgtaatca 


aatctactcc 


6001 


cgacttttca 


aaactgctct 


bObl 


gatctttgtt 


tgtccactga 


6101 


actaatcctg 


agaaaacatc 


6151 


ttcggttttg 


agacaaaacc 


6201 


gcaagaaccc 


cgaaacttgt 


6251 


tttggtagac 


gcaaattttc 


6301 


gaaaacttcg 


cacataaaag 


6351 


acatacacat 


acgccgagac 


6401 


gggttcaatc 


cgattaccgc 


6451 


ttctggcttt 


ggtatgacct 


6501 


tgtatccacg 


tgatggtttt 


6551 


cttgcatcac 


aagcttcgat 


6601 


ggagactcca 


ttgttttgta 


6651 


gaagttcgtg 


tccttttact 


6701 


cgattggttc 


taaggagatt 


6751 


cttgttgcac 


tctttattca 


6801 


gaagataaga 


aagaaattct 


6851 


ctcatgatcc 


tgtcgggaat 


6901 


atgaaaaaat 


caggatcggt 


6951 


atatatcctt 


tctttgttta 


7001 


gctagtccta 


aaacaaacaa 


7051 


aatgttactt 


gtgatacctt 


7101 


caaatctaca 


gtaatttaat 


7151 


acccaacaat 


ctttggattt 


7201 


caactcatta 


tataataatg 


7251 


taataaacat 


ccagcttgta 


7301 


ggccaattta 


ttttcttata 


7 J 51 


tagcagattt 


aatttctaat 


7401 


ttatctttct 


ttttatttcc 


7451 


tagtaatttt 


aaattttaag 


7501 


aactctagtt 


atacttttgt 


7551 


tttttagtta 


aattacaata 


7601 


acttttttgg gatcaattag 


7651 


aggataccaa 


aaatgactag 


7701 


ataacataca 


tacgtgttac 



cggaaaacaa acgaaagaac aatcagatcc 
aattcaactt ctctgcgttt atgttgtaga 
cgaaacaata caacgtcgtt gcttggagtc 
aatgctttta atatctttca ctttaaccca 
ttaaaaccca taactcgtga acatcttctt 
cgaatagcac ctagcttccc ttcgtatctg 
agagttcgga gtatggaaga aggaccaagt 
ggatcacatt gttgttccgt gatatccaat 
atcgggttgg aaaaaattaa tctgtctgtt 
taatctcttc caggtaaacg aatcagaatc 
ttctgtgatt caaatggtag ataccccgag 
tgcgaaagcc tttgtatttt ataccggaaa 
taaacccaat gacatatccc aacccttcac 
gatactgttt agtggttggt ttgaagacta 
gtatacttaa cacaaagcaa tatcccatga 
ctttatcatt ccgggtggca gaaagtcgat 
aatcactcct ctcatggaca aaactggttc 
atgtagtgtt gtatgaagta tcccgaaata 
aagattgaca aaccatgact cgtagcttct 
ggagcctgaa ttttccgatt tttgacgccg 
tggatcatgt cttgatttat caccggagaa 
aaagagatga gcacgatcac tgaatgagaa 
agagaacaac ttatgatgaa taaagtgttt 
aggaaagtat caaaatttgc ctttttcttc 
attaaccaaa agataaaatc tttcatgatt 
aagccaaaac tttatcttta gacttttaac 
tgctagactt aggaaacaac tttttttttt 
taattgtttt tttttctact aatagattaa 
tttctatcat aattgacaat tctttctttt 
taataatcca caagtcaatt tcaccatttt 
aaaattagca caaaaaagat tatcattgtt 
taacttacgt aatttccatt ttccatagat 
ttagttatct tagtactttc ttagtttcct 
ataatatatt gaaattaaaa gaagaaaaaa 
taaatgtttc atcacactaa ctaataattt 
tataaacact gaagaaagtt tttggcccac 
tactatagtt aggggaagat tctgatttaa 
ttaggacatg aatgaaaact tataatctca 
tgaacaatag taacatctta cgtgttttgt 
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7751 


ccatatattt 


gttgcttata 


7801 


gcttttaaga 


agcacaaaac 


7851 


accaaaaaaa 


aaaattaaat 


7901 


cctacgttga 


gccttgttga 


7951 


tccgtccaga 


tcaaccctct 


8001 


aagaggcaaa 


atccttgttt 


8051 


aagttgtaga 


ggcaaacatg 


8101 


gcttggtatc 


cacgtaatat 


8151 


ctttttccca 


cgacttttca 


8201 


acatcttctt 


gattgttgtt 


8251 


ttcgtagctg 


actaactctg 


8301 


aagaccaagt 


ttcggttttg 


8351 


gatctccaat 


gcaagaaccc 


8401 


accgtctgtt 


ctcggtagac 


8451 


aatcggaatc 


aaaaacttcg 


8501 


ccaggcgatt 


tcgaaagcct 


8551 


ggttaccgtt 


agacctaatg 


8601 


gtatgatctg 


atactgtttt 


8651 


attggtcttg 


tatcattata 


8701 


agctttgatc 


tttacctctc 


8751 


tgttatccaa 


atctctcctc 


8801 


cctctttcgt 


agcgttctag 


8851 


gagatttagg 


ttgacaaacc 


8901 


tattgatgag 


cctcaatttt 


8951 


acctcttgga 


tcgtgtcctg 


9001 


ggaaaaaaga 


aagaaagaga 


9051 


aatcaggafct 


ggtagagaac 


9101 


gtatcacaaa 


ttgccttttt 


9151 


caaagataaa 


atcttcatta 


9201 


taaaaatata 


tataaaatat 


9251 


caaagtttct 


cttttgactt 


9301 


gacttagaaa 


acaacatttt 


9351 


caatctttgg 


gtttctatag 


9401 


ctcattatat 


catatacaaa 


9451 


tttaaaaatt 


agcaagtttg 


9501 


attaaactta 


gtgatttcca 


9551 


ttagtacttt 


aaattttcat 


9601 


actccagttt 


aacttatgtt 


9651 


gtaataaata 


ttttagcttt 



aatatattca tataacaatg tttgcattaa 
catataacaa aattaaatat tcctatccct 
attcctacag ccttgttgat tattttatgc 
ctagtttgca tttgtcggtc catttcttct 
cgtaatcaga acaaaagggg aaacaaacgt 
gtatgaacta agtttaactt ctctgtgttt 
atcccaacta gaaagcatta cgacgtcgtt 
gctctactcc aatgctttca atatctttca 
aaactgctct ttaaaaccca taatctgtga 
tatccagtga cgaataacac ctagcttccc 
ggaataaacc aacgtttgga gtatgtaaga 
ggacataacc ggatcacatt gtggttccat 
tgaagcttgt accgggtttg aaagaattag 
gcaaattttt taatctcttc cacataaacg 
cacgcaaaag ttctgagatt ccgagtcata 
aaatatttta taccggaaag gctgcaatcc 
acttatcaca actcctcact tttgggtttg 
gttgttggtt tgcagactat gtattccggt 
acaaagcaat atcccatgac gtgcatcaca 
cttgtggcag aaaatcgatg gagactcctt 
tcatggaaaa aactggtatc aagtttgtat 
gaagtatcca gagatattgt tggttcgatg 
aagactcgta gcttctcttg ttgcactctt 
ccgatttcgg acccccgaag ataagaaaga 
atttatcacc ggagaactca tgatcttatt 
tgagcacgat cagtgaatga gatatataga 
cgacgatgat gaatatacaa gtgtttataa 
cttcgctagt cccaaaacaa gcaaattaac 
atgttttcct ttttcttcgc cagtcccaga 
ttcattaggt tacttgtagt accttgagcc 
ttaaccaaat taacagtaat ttaatagcta 
gtatatatat tctttgacat caaaattcaa 
tgtttttttt cttattctaa tagattacca 
gtgtttcctt ttcaatcaac atccattttc 
ttcttatatc atcattcagc agatttctta 
ttttgcacct atatgtttct ctttcttagt 
atatataatt tattaaaatt aaaagtaaaa 
aaatgtttca tcacactaaa agagcattaa 
atgaaaaaaa atatcaaatc actgaagaca 
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9701 tttgttggcc tatactctat tttttatttg gccaattagt aatagactaa 

9751 tagtaactca tatgatatct ctctaattct ggcgaaacga atattctgat 

9801 tctaaagata gtaaaaatga attttgatga agggaatact atttcacaca 

9851 cctagaaaga gtaaggtaga aacctttttt tttttggtca gattcttgta 

9901 tcaagaagtt ctcatcgat 
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nim : 267 VSNVHKALDSDDIELVKLLLKEDHTNLDDACALHFAVAYCN 307 

+ + +ALD+ DIELVKL++ + +LDDA A+H+AV +CN 
Rice-1 : 33 IRRMRRALDAADIELVKLMVMGEGLDLDDALAVHYAVQHCN 155 

nim 327 PRGYTVLHVAAMRKEPQLILSLLEKGASASEATLEGRT 364 

P G T LH+AA P ++ LL+ A + T +G T 
Rice-1 : 215 PTGKTALHLAAEMVSPMfVSVLLDHHADXNFRTXDGVT 328 

nim 267 VSNVHKALDSDDIELVKLLLKEDHTNLDDACALHFAVAYCN 307 

+ + +ALD+ DIELVKL++ + +LDDA A+H+AV +CN 
Rice-2 : 33 IRRMRRALDAADIELVKLMVMGEGLDLDDALAVHYAVQHCN 155 

nim : 325 RNPRGYTVLHVAAMRKEPQLILSLLEK 351 

R P T LH+AA P ++ LL++ 
Rice-2 : 208 RRPDSKTALHLAAEMVSPMIVSVLLDQ 288 

nim : 267 VSNVHKALDSDDIELVKLLLKEDHTNLDDACALHFAVAYCN 307 

+ + +ALD+ DIELVKL++ + +LDDA A+H+AV +CN 
Rice-3 : 33 IRRMRRALDAADIELVKLMVMGEGLDLDDALAVHYAVQHCN 155 

nim : 325 RNPRGYTVLHVAAMRKEPQLILSLLEK 351 

R P T LH+AA P ++ LL++ 
Rice-3 : 208 RRPDSKTALHLAAEMVSPDMVSVLLDQ 288 

nim : 267 VSNVHKALDSDDIELVKLLLKEDHTNLDDACALHFAVAYCN 307 

+ + +ALD+ DIELVKL++ + +LDDA A+H+AV +CN 
Rice-4 : 33 IRRMRRALDAADIELVKLMVMGEGLDLDDALAVHYAVQHCN 155 

nim : 327 PRGYTVLHVAAMRKEPQLI 345 

P G T LH+AA P ++ 
Rice-4 : 215 PTGKTALHLAAEMVSPDMV 271 
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